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SFJ .ErTTON OF PROTEINS 
5 TTSTNG RNA-PROTFTN FUSIONS 

Background of the Invention 
This invention relates to protein selection methods. 
The invention was made with government support under grant 
F32 GMl 7776-01 and F32 GM 17776-02. The government has certain rights in the 
10 invention. 

Methods currently exist for the isolation of RNA and DNA molecules 
based on their functions. For example, experiments of Ellington and Szostak (Nature 
346:818 (1990); and Nature 355:850 (1992)) and Tuerk and Gold (Science 249:505 

(1990) ; and J. Mol. Biol 222:739 (1991) ) have demonstrated that very rare (i.e., less 
15 than 1 in 10'^) nucleic acid molecules with desired properties may be isolated out of 

complex pools of molecules by repeated rounds of selection and amplification. These 
methods offer advantages over traditional genetic selections in that (i) very large 
candidate pools may be screened ( > 10*0, (iO host viability and in vivo conditions are 
not concerns, and (iii) selections may be carried out even if an in vivo genetic screen 
20 does not exist. The power of in vitro selection has been demonstrated in defining 

novel RNA and DNA sequences with very specific protein binding functions (see, for 
example, Tuerk and Gold, Science 249:505 (1990); Irvine et al., J. Mol. Biol 222:739 

(1991) ; Oliphant et al., Mol. Cell Biol. 9:2944 (1989); Blackwell et al., Science 
250:1 104 (1990); Pollock and Treisman, Nuc. Acids Res. 18:6197 (1990); Thiesen 

25 and Bach, Nuc. Acids Res. 18:3203 (1990); Bartel et al.. Cell 57:529 (1991); Stormo 
and Yoshioka, Proc. Natl. Acad. Sci. USA 88:5699 (1991); and Bock et al.. Nature 
355:564 (1992)), small molecule binding functions (Ellington and Szostak, Nature 
346:818 (1990); Ellington and Szostak, Nature 355:850 (1992)), and catalytic 
functions (Green et al.. Nature 347:406 (1990); Robertson and Joyce, Nature 344:467 

30 (1990); Beaudry and Joyce, Science 257:635 (1992); Bartel and Szostak, Science 
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261:1411 (1993); Lorsch and Szostak, Nature 371:31-36 (1994); Cuenoud and 
Szostak, Nature 375:61 1-614 (1995); Chapman and Szostak, Chemistry and Biology 
2:325-333 (1995); and Lohse and Szostak, Nature 381.442-444 (1996)). A similar 
scheme for the selection and ampUfication of proteins has not been demonstrated. 

5 Summary o f tha Invention 

The purpose of the present invention is to allow the principles of in jdUfi 
selection and in yitm evolution to be applied to proteins. The invention facilitates the 
isolation of proteins with desired properties from large pools of partially or 
completely random amino acid sequences. In addition, the invaition solves the 
1 0 problem of recovering and amplifying the protein sequence information by covalently 
attaching the mRNA coding sequence to the protein molecule. 

In general, the inventive method consists of an in JOtEQ or in Situ 
transcription/ translation protocol that generates protein covalently Unked to the 3' end 
of its own mRNA, i.e., an RNA-protein fiision. This is accomplished by synthesis 
15 and iaJdIIfiorinsitll translation of anmRNAmoleculewithapeptide acceptor 
attached to its 3' end. One prefened peptide acceptor is puromycin. a nucleoside 
analog that adds to the C-terminus of a growing peptide chain and terminates 
translation. In one preferred design, a DNA sequence is included between the end of 
the message and the peptide acceptor which is designed to cause the ribosome to 
20 pause at the end of the open reading frame, providing additional time for the peptide 
acceptor (for example, puromycin) to accept the nascent peptide chain before 
hydrolysis of the peptidyl-tRNA linkage. 

If desired, the resulting RNA-protein fiision allows repeated rounds of 
selection and amplification because the protein sequence information may be 
25 recovered by reverse transcription and amplification (for example, by PCR 

amplification as well as any other ampUfication technique, including RNA-based 
amplification techniques such as 3SR or TS A). The amplified nucleic acid may then 
be transcribed, modified, and in jdfifi or in alB translated to generate mRNA-protein 
fusions for the next round of selection. The ability to carry out multiple rounds of 
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selection and amplification enables the enrichment and isolation of very rare 
molecules, e.g., one desired molecule out of a pool of 10'^ members. This in turn 
allows the isolation of new or improved proteins which specifically recognize 
virtually any target or which catalyze desired chemical reactions. 



5 Accordingly, in a first aspect, the invention features a method for selection 

of a desired protein, involving the steps of: (a) providing a population of candidate 
RNA molecules, each of which includes a translation initiation sequence and a start 
codon operably linked to a candidate protein coding sequence and each of which is 
operably linked to a peptide acceptor at the 3* end of the candidate protein coding 

10 sequence; (b) in vitro or in situ translating the candidate protein coding sequences to 
produce a population of candidate RNA-protein fusions; and (c) selecting a desired 
RNA-protein fiision, thereby selecting the desired protein. 

In a related aspect, the invention features a method for selection of a DNA 
molecule which encodes a desired protein, involving the steps of: (a) providing a 

1 5 population of candidate RNA molecules, each of which includes a translation 

initiation sequence and a start codon operably linked to a candidate protein coding 
sequence and each of which is operably linked to a peptide acceptor at the 3' end of 
the candidate protein coding sequence; (b) in vitro or in situ translating the candidate 
protein coding sequences to produce a population of candidate RNA-protein fusions; 

20 (c) selecting a desired RNA-protein fusion; and (d) generating from the RNA portion 
of the fusion a DNA molecule which encodes the desired protein. 

In another related aspect, the invention features a method for selection of a 
protein having an altered function relative to a reference protein, involving the steps 
of: (a) producing a population of candidate RNA molecules from a population of 

25 DNA templates, the candidate DNA templates each having a candidate protein coding 
sequence which differs firom the reference protein coding sequence, the RNA 
molecules each comprising a translation initiation sequence and a start codon operably 
Unked to the candidate protein coding sequence and each being operably linked to a 
peptide acceptor at the 3' end; (b) in vitro or in situ translating the candidate protein 
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coding sequences to produce a population of candidate RNA-protein fusions; and (c) 
selecting an RNA-protein fusion having an altered function, thereby selectmg the 
protein having the altered function. 

In yet another related aspect, the invention features a method for selection 
5 of aDNAmoleculewhichencodesapK>temhavinganalteredfunctionrelativetoa 

reference protein, involving the steps of: (a) producing apopulation of candidate RNA 
molecules from a population of candidate DNA templates, the candidate DNA 
templates each having a candidate protein coding sequence which differs from the 
reference protein coding sequence, the RNA molecules each comprising a translation 
10 initiation sequence and a start codon operably linked to the candidate protein codmg 
sequence and each being operably linked to a peptide acceptor at the 3' end; (b) in 
vitro or in Situ translating the candidate protein coding sequences to produce a 
population of RNA-protein fusions; (e) selecting an RNA-protein fusion having an 
altered function; and (d) generating from the RNA portion of the fixsion a DNA 
1 5 molecule which encodes the protein having the altered function. 

In yet another related aspect, the invention features a method for selection 
of a desired RNA, involving the steps of: (a) providing apopulation of candidate 
RNA molecules, each of which includes a translation initiation sequence and a start 
codon operably linked to a candidate protein coding sequence and each of which is 
20 operably linked to a peptide acceptor at the 3' end of the candidate protein coding 
sequence; (b)mvitmormsim translating the candidateprotein coding s^^^^^ 

produce a population of candidate RNA-protein fiisions; and (c) selecting a desired 
RNA-protein fusion, tiiereby selecting the desired RNA. 

In preferred embodiments of the above methods, tiie peptide acceptor is 
25 puromycin; each of the candidate RNA molecules further includes a pause sequence 
or further includes a DNA or DNA analog sequence covalentiy bonded to the 3' end of 
the RNA; the population of candidate RNA molecules includes at least 10', 
preferably, at least lO-. more preferably, at least 10". 10-, or 10", and, most 
preferably, at least 10- different RNA molecules; the midim translation reaction is 
30 carried out in a lysate prepared from a eukaryotic cell or portion thereof (and is, for 
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cxample, carried out in a reticulocyte lysate or wheat germ lysate); the in vitro 
translation reaction is carried out in an extract prepared from a prokaryotic cell (for 
example, R coli^ or portion thereof; the selection step involves binding of the desired 
protein to an immobilized binding partner; the selection step involves assaying for a 
5 functional activity of the desired protein; the DNA molecule is ampHfied; the method 
further involves repeating the steps of the above selection methods; the method 
further involves transcribing an RNA molecule from the DNA molecule and repeating 
steps (a) through (d); following the in vitro translating step, the method further 
involves an incubation step carried out in the presence of 50-100 mM Mg^^; and the 

1 0 RNA-protein fusion further includes a nucleic acid or nucleic acid analog sequence 
positioned proximal to the peptide acceptor which increases flexibility. 

In other related aspects, the invention features an RNA-protein fusion 
selected by any of the methods of the invention; a ribonucleic acid covalently bonded 
though an amide bond to an amino acid sequence, the amino acid sequence being 

15 encoded by the ribonucleic acid; and a ribonucleic acid which includes a translation 
initiation sequence and a start codon operably linked to a candidate protein coding 
sequence, the ribonucleic acid being operably linked to a peptide acceptor (for 
example, puromycin) at the 3' end of the candidate protein coding sequence. 

In a second aspect, the invention features a method for selection of a 

20 desired protein or desired RNA through enrichment of a sequence pool. This method 
involves the steps of: (a) providing a population of candidate RNA molecules, each 
of which includes a translation initiation sequence and a start codon operably linked 
to a candidate protein coding sequence and each of which is operably linked to a 
peptide acceptor at the 3' end of the candidate protein coding sequence; (b) in vitro or 

25 in situ translating the candidate protein coding sequences to produce a population of 
candidate RNA-protein fusions; (c) contacting the population of RNA-protein fusions 
with a binding partner specific for either the RNA portion or the protein portion of the 
RNA-protein fusion under conditions which substantially separate the binding 
partner-RNA-protein fusion complexes from unbound members of the population; (d) 

30 releasing the bound RNA-protein fusions from the complexes; and (e) contacting the 
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p„p„U.io„ of RNA-proteu, «o„s ft»m -ep (d) wift abinding p.«ner specific for 
tt.cpro.e,„ponionofacd«ucdRNA-protein<usionundcrcondito»h.ch 

.„b.un«y sepa«.e .he binding p.r»er-RNA-pro.cin «cncomp.ex fton. unbound 
„,e™bers of .aid population, d,.reby sel^^Ung U,e de^«d pn,«in and «,e dcned 

5 RNA 

,n prefeired en*odime„K the medwd further involvea repeating steps (a) 

U„,ugl, (e). in addition, for ti»se repeated step^ «.e s«ne or different binding 
partners may be used, in any order, for selective enrichment of the desired RNA- 
p„,.inf>.sion. lnano.h«rprefe,redembodiment.step(d)invoWesfl.eus.ofa 
,0 bindingpa,.ner<for«c,mp.e.an,onoo,on.lantibody,sp.cificforti,ep..t^npor.on 
ofthedesiredfi,sion.Thiss.epiaprefer,blyc..riedo«tf611owingreverse 
«tionofth.BNAportionof«.e*.iontoge„era.eaDNAwhiche„c„d«,^ 
aesired protein. ,f desired, this DNA may be isola«i and/or PCRanrplUled^ Tb,s 
enrichment technique n»y be used ti.selecade»red protein or may be used to select 
,5 sproteinhavinganalteredfimctionrelativotoareferenceprotem. 

1„ oti»r pref»r«l embodiments of the enrichment methods, the peptide 
„cep.orisp»«mycin;eachof.hecandida.eimAmole»,es«.erincl„desapause 

se^u^^ceorS^titerinch.desaDNAorDHAanalogse^ue^covalently^"-''^'" 
2 y end of the RNA; the population of candidate RNA molecules .ucludes a. leas. 

preferably, a. least 10", more preferably, a. least lO". 10». or lO", and, mos. 
preferably,.. lea^lO-differen. RNA molecules; ti-in^transUtion reaction ,s 

cried out in a lysate prepare! Som a eukaryotic cell or portion d,e^f (and ,s. fbr 

example, carried on. in a reticulocyte lysate or wheat gem. lysate); .he m SUm 
^anslationreactioniscarriedoutinanextiactpreparedfiomaprokaryoticcellor 

2S p„rtion«,ereof(forexamp.e,l.^-.theDNAmolecu.eis»„plified; atleastoneof 
ti,e binding partners is immobilized on a soUd support; following ti,e in Bl^ 
^.lating step, «.e method further involves an incubation step carried ou. m the 
presence of 50-100 mM Mg- and the RNA-protein fusion further includes a nudcc 
acid or nucWc acid analog .e,u«,ce positioned proximal ti. «» peptide acceptor 
30 which increases flexibility. 
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In a related aspect, the invention features kits for carrying out any of the 
selection methods described herein. 

In a third and final aspect, the invention features a microchip that includes 
an array of immobilized single-stranded nucleic acids, the nucleic acids being 
5 hybridized to RNA-protein fusions. Preferably, the protein component of the RNA- 
protein fusion is encoded by the RNA. 

As used herein, by a "population" is meant more than one molecule (for 
example, more than one RNA, DNA, or RNA-protein fusion molecule). Because the 
methods of the invention facilitate selections which begin, if desired, with large 

1 0 numbers of candidate molecules, a "population" according to the invention preferably 
means more than 10^ molecules, more preferably, more than 10", 10", or 10*^ 
molecules, and, most preferably, more than 10'^ molecules. 

By "selecting" is meant substantially partitioning a molecule from other 
molecules in a population. As used herein, a "selecting" step provides at least a 2- 

1 5 fold, preferably, a 30-fold, more preferably, a 100-fold, and, most preferably, a 1000- 
fold enrichment of a desired molecule relative to undesired molecules in a population 
following the selection step. As indicated herein, a selection step may be repeated 
any number of times, and different types of selection steps may be combined in a 
given approach. 

20 By a **protein" is meant any two or more naturally occurring or modified 

amino acids joined by one or more peptide bonds. "Protein" and "peptide" are used 
interchangeably herein. 

By "RNA" is meant a sequence of two or more covalently bonded, 
naturally occurring or modified ribonucleotides. One example of a modified RNA 
25 included within this term is phosphorothioate RNA. 

By a "translation initiation sequence" is meant any sequence which is 
capable of providing a functional ribosome entry site. In bacterial systems, this 
region is sometimes referred to as a Shine-Dalgamo sequence. 

By a "start codon" is meant three bases which signal the beginning of a 



PCT/US98/00807 

WO 98/31700 

-8- 

p„,ei„ coding scuence. G»»a«y. tt.es« bases »e AUG (or ATG); h„wev». 
„,h« base mplet capable otbeing uSlized in this manner may be subsututed. 

By "covalently bonded" to a peptide acceptor is meant that the pept.de 
acceptor .s.oined to a "proton coding s^^' either directly tht^gh a cov^».. 
5 bond or indirectly through another covalentty b««ied sequence (for example. DNA 
corresponding to a pause site). .j,,„ 
By a "peptide acceptor" is meant an, molecule capable of betng added to 
d,e C-tcrminus of a growing pro,«n chainby the catalytic acH,ity of theribosomal 
peptidyl transferase function. Typically, such molecules cont^n (i) a nuclcottde or 
,0 nucleotide-like moiety (for ex^nple. adenosine or an adenosine analog (d,- 

„ethylation at the N-6 amino positi™, is acceptable)), (ii) «. amino acid or amtno 
acid.,il.emoicty(forex«nple.anyof.he20D-orL.aminoacidsoranyannno^d 

analog th^eof (for example. Omethyl tyrosine or any of the analogs descrtbed by 
. Elhn»e.a,..Meth.Eozymol.2.2:30.,.991).and(iii)alin.ageb.™e»thetwof^ 

15 example, an ester, amide, or ketone lin^c a. the 3' position or. less ^^"•^'^^^ 
position,; prefe^bly.thishnkage does not signi.ican.lype.turb the pudrerofthenng 

ftom the namral ,ib»,»cleoade conformation. Peptide acceptors may also possess a 
nucleoplule. Which ma, be. Without limitation, an amino gro»p..hydroxyl group, or 

asulfl,ydrylg~up.h,».dition,peptide«ceptorsm.ybecompo^ofn„cleot.de 

mimedcs. amino acid mimetics.ormimeticsot.hecombinednucleot.de-am.noac.d 



20 

structure 



25 



30 



By a peptide accq>tor being positional "a. the 3' end" of a ptotein coding 
sequence is meant that the peptide acceptor molecule is positions, after the final 
codon of that protein coding s«,u»«e. This t™. inctadcs. without limitabon. a 
peptide acceptor molecule that is positioned precisely at the end of the proton 
crdings«,uenceasw.llasonewhichissepa,a.».ftom.be«nalcodonbym,e,ven,ng 

coding or non^ing sequence (for example, a sequence corresponding to a pause 
site) This t»m also includes constmcts in which coding or non-coding s«p.ences 
follow (that is. are 3' to) the peptide acceptor molecule. In addition. th.s lem, 
encompasses, without hmitation, a peptide ««ept»r molecule that is covalentty 
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bonded (either directly or indirectly through intervening nucleic acid sequence) to the 
protein coding sequence, as well as one that is joined to the protein coding sequence 
by some non-covalent means, for example, through hybridization using a second 
nucleic acid sequence that binds at or near the 3' end of the protein coding sequence 

5 and that itself is bound to a peptide acceptor molecule. 

By an "altered function" is meant any qualitative or quantitative change in 
the function of a molecule. 

By a '*pause sequence" is meant a nucleic acid sequence which causes a 
ribosome to slow or stop its rate of translation. 

10 By "binding partner," as used herein, is meant any molecule which has a 

specific, covalent or non-covalent affinity for a portion of a desired RNA-protein 
fusion. Examples of binding partners include, without limitation, members of 
antigen/antibody pairs, protein/inhibitor pairs, receptor/ligand pairs (for example cell 
surface receptor/ligand pairs, such as hormone receptor/peptide hormone pairs), 

1 5 enzyme/substrate pairs (for example, kinase/substrate pairs), lectin/carbohydrate pairs, 
oligomeric or heterooligomeric protein aggregates, DNA binding protein/DNA 
binding site pairs, RNA/protein pairs, and nucleic acid duplexes, heleroduplexes, or 
ligated strands, as well as any molecule which is capable of forming one or more 
covalent or non-covalent bonds (for example, disulfide bonds) with any portion of an 

20 RNA-protein fusion. Binding partners include, without limitation, any of the 
"selection motifs" presented in Figure 2. 

By a "solid support" is meant, without limitation, any column (or column 
material), bead, test tube, microtiter dish, solid particle (for example, agarose or 
sepharose), microchip (for example, silicon, silicon-glass, or gold chip), or membrane 

25 (for example, the membrane of a liposome or vesicle) to which an affinity complex 

may be bound, either directly or indirectly (for example, through other binding partner 
intermediates such as other antibodies or Protein A), or in which an affinity complex 
may be embedded (for example, through a receptor or channel). 

The presently claimed invention provides a number of significant 

30 advantages. To begin with, it is the first example of this type of scheme for the 
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selection and amplification of proteins. This technique overcomes the impasse 
created by the need to recover nucleotide sequences conresponding to desired, isolated 
proteins (since only nucleic acids can be repUcated). In particular, many prior 
methods that allowed the isolation of proteins from partially or folly randomized 
5 pools did so through an insdya step. Methods of this sort include monoclonal 

antibody technology (Milstein, Sci. Amer. 243:66 (1980); and Schultz et al., J. Chem. 
Engng. News 68:26 (1990)), phage display (Smith, Science 228:1315 (1985); Parmley 
and Smith, Gene 73:305 (1988); and McCaflFerty et al.. Nature 348:552 (1990)), 
peptide-lac repressor fosions (Cull et al.. Proc. Natl. Acad. Sci. USA 89:1865 (1992)), 
10 and classical genetic selections. Unlike the present technique, each of these methods 
relies on a topological Unk between the protein and the nucleic acid so that the 
information of the protein is retained and can be recovered in readable, nucleic acid 
form. 

In addition, the present invention provides advantages over the stalled 
1 5 translation method (Tuerk and Gold, Science 249:505 (1990); kvine et al., J. Mol. 
Biol 222:739 (1991); Korman et al., Proc. Natl. Acad. Sci. USA 79:1844-1848 
(1982); Mattheakis et al., Proc. Natl. Acad. Sci. USA 91:9022-9026 (1994); 
Mattheakis et al., Meth. Enzymol. 267:1 95 (1996); and Hanes and Pluckthun, Proc. 
Natl Acad Sci. USA 94:4937 (1997)). a technique in which selection is for some 
20 property of a nascent protein chain that is still complexed with the ribosome and its 
mRNA. Unlike the stalled translation technique, the present method does not rely on 
maintaining the integrity of an mRNA: ribosome: nascent chain ternary complex, a 
complex that is very fragile and is therefore limiting with respect to the types of 
selections which are technically feasible. 
25 The present method also provides advantages over the branched synthesis 

approach proposed by Brem,er and Lemer (Proc. Natl. Acad. Sci. USA 89:5381-5383 
(1992)), in which DNA-peptide fosions are generated, and g^etic information is 
theoretically recovered following one round of selection. Unlike the branched 
synthesis approach, the present method does not require the regeneration of a peptide 
30 from the DN A portion of a fosion (which, in the branched synthesis approach, is 
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generally accomplished by individual rounds of chemical synthesis). Accordingly, 
the present method allows for repeated rounds of selection using populations of 
candidate molecules. In addition, unlike the branched synthesis technique, which is 
generally limited to the selection of fairly short sequences, the present method is 

5 appHcable to the selection of protein molecules of considerable length. 

In yet another advantage, the present selection and directed evolution 
technique can make use of very large and complex Hbraries of candidate sequences. 
In contrast, existing protein selection methods which rely on an jn vivo step are 
typically limited to relatively small libraries of somewhat limited complexity. This 

1 0 advantage is particularly important when selecting functional protein sequences 

considering, for example, that 10*^ possible sequences exist for a peptide of only 10 
amino acids .in length. In classical genetic techniques, lac repressor fusion 
approaches, and phage display methods, maximum complexities generally fall orders 
of magnitude below 10'^ members. Large Hbrary size also provides an advantage for 

1 5 directed evolution applications, in that sequence space can be explored to a greater 
depth around any given starting sequence. 

The present technique also differs from prior approaches in that the 
selection step is context-independent. In many other selection schemes, the context in 
which, for example, an expressed protein is present can profoundly influence tiie 

20 nature of the library generated. For example, an expressed protein may not be 
properly expressed in a particular system or may not be properly displayed (for 
example, on the surface of a phage particle). Alternatively, the expression of a protein 
may actually interfere with one or more critical steps in a selection cycle, e.g., phage 
viability or infectivity, or lac repressor binding. These problems can result in the loss 

25 of functional molecules or in limitations on the nature of the selection procedures that 
may be applied. 

Finally, the present method is advantageous because it provides control 
over the repertoire of proteins that may be tested. In certain techniques (for example, 
antibody selection), there exists little or no control over the nature of the starting pool. 
30 In yet other techniques (for example, lac fusions and phage display), the candidate 
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pool must be expressed in the context of a fusion protein. In contrast, RNA-protein 
fusion constructs provide control over the nature of the candidate pools available for 
screening. In addition, the candidate pool size has the potential to be as high as RNA 
or DNA pools (~ 10- members), Umited only by the size of the in jdlEQ translation 

5 reaction performed. And the makeup of the candidate pool depends completely on 
experimental design; random regions may be screened in isolation or within the 
context of a desired fusion protein, and most if not all possible sequences may be 
expressed in candidate pools of RNA-protein fusions. 

Other features and advantages of the invention will be apparent from the 

10 following detailed description, and from the claims. 

"^tfli!'?^1 r>ftscriorion 
The drawings will first briefly be described. 

Pr^^f n^.;;i-iptioTi thT T^'^wines 
FIGURES lA-lC are schematic representations of steps involved in the 
pn>duction of RNA-protein fusions. Figure 1 A illustrates a sample DNA construct for 
generation of an RNAportion of a fusion. Figure IB illustrates the generation of an 
RNA/puromycin conjugate. And Figure IC illustrates the generation of an RNA- 
protein fusion. 

HGURE 2 is a schematic representation of a generalized selection 

protocol according to the invention. 

FIGURE 3 is a schematic representation of a synthesis protocol for 
minimal translation templates containing 3' puromycin. Step (A) shows the addition 
of protective groups to tiie reactive functional groups on puromycin (S'-OH and NH.); 
as modified, these groups are suitably protected for use in phosphoramidite based 
oligonucleotide syntiiesis. The protected puromycin was attached to ammohexyl 
controlled pore glass (CPG) Oirough tiie 2'OH group using the standard protocol for 
attachment of DNA through its 3-OH (Gait. Oligonucleotide Synti^esis. A Practical 
Approach, The Practical Approach Series (IRL Press. Oxford, 1984)). In step (B), a 
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minimal translation template (tenned "43-P"), which contained 43 nucleotides, was 
synthesized using standard RNA and DNA chemistry (Millippre, Bedford, MA), 
deprotected using NH4OH and TBAF, and gel purified. The template contained 13 
bases of RNA at the 5' end followed by 29 bases of DNA attached to the 3' puromycin 

5 at its 5' OH. The RNA sequence contained (i) a Shine-Dalgamo consensus sequence 
complementary to five bases of 16S rRNA (Stormo et al., Nucleic Acids Research 
10:2971-2996 (1982); Shine and Dalgamo, Proc. Natl. Acad. Sci. USA 71:1342-1346 
(1974); and Steitz and Jakes, Proc. Natl. Acad. Sci. USA 72:4734-4738 (1975)), (ii) a 
five base spacer, and (iii) a single AUG start codon. The DNA sequence was 

1 0 dA27dCdCP, where "P" is puromycin. 

FIGURE 4 is a schematic representation of a preferred method for the 
preparation, of protected CPG-linked puromycin. 

FIGURE 5 is a schematic representation showing possible modes of 
methionine incorporation into a template of the invention. As shown in reaction (A), 

1 5 the template binds the ribosome, allowing formation of the 70S initiation complex. 
Fmet tRNA binds to the P site and is base paired to the template. The puromycin at 
the 3' end of the template enters the A site in an intramolecular fashion and forms an 
amide linkage to N-fomiyl methionine via the peptidyl transferase center, thereby 
deacylating the tRNA. Phenol/chloroform extraction of the reaction yields the 

20 template with methionine covalently attached. Shown in reaction (B) is an undesired 
intermolecular reaction of the template with puromycin containing oligonucleotides. 
As before, the minimal template stimulates formation of the 70S ribosome containing 
finet tRNA bound to the P site. This is followed by entry of a second template in 
trans to give a covalently attached methionine. 

25 FIGURES 6A-6H are photographs showing the incorporation of ^'S 

methionine (^*S met) into translation templates. Figure 6A demonstrates magnesium 
(Mg^*) dependence of the reaction. Figure 6B demonstrates base stability of the 
product; the change in mobility shown in this figure corresponds to a loss of the 5' 
RNA sequence of 43-P (also tenned "Met template") to produce the DNA-puromycin 

30 portion, tenned 30-P. The retention of the label following base treatment was 
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consistent with the fonnation of a peptide bond between S methionine and the 3' 
puromycin of the template. Figure 6C demonstrates the inhibition of product 
formationin the presenceofpeptidyl transferase inhibitors. Figure 6D demonstrates 
the dependence of «S methionine incorporation on a template coding sequence. 
5 Figure 6E demonstrates DNA template length dependence of -S methionme 

incorporation. Figure 6F illustrates cis versus trans product formation using templates 
43-P and 25-P. Figure 6G illustrates cis versus trans product formation usmg 
templates 43-P and 13-P. Figure 6H illustrates cis versus trans product formation 
using templates 43-P and 30-P in a reticulocyte lysate system. 

FIGURES 7A-7C are schematic illustrations of constructs for testing 
peptide fusion formation and selection. Figure 7A shows LP77 Oigated-product," 
"77" nucleotides longXalso termed. "shortmyctemplate'MSEQID NO: 1). 
sequencecontainsthec-myc monoclonal antibody epitope tagEQKLISEEDL(SEQ 

ID NO- 2) (Evan et al.. Mol. Cell Biol. 5:3610-3616 (1985)) flanked by a 5' start 
codon and a 3' linker. Hie 5' region contains a bacterial Shine-Dalgamo sequence 
identical to that of 43-P. The coding sequence was optimized for translaUon m 
bacterial systems. In particular, the 5' UTOs of 43-P and LP77 contained a 
Shine-Dalgamo sequence complementary to fivebasesofl6SrRNA(Steitz and 

Jakes Proc Natl. Acad. Sci. USA 72:4734-1738 (1975)) and spaced similarly to 
, ::2Llproteinsequences(Stormoetal,NucleicAcidsKes..0:297. 

Figure 7B shows LPl 54 (ligated product, 154 nucleotides long) (also termed long 
n.yc template") (SEQ ID NO:3). This sequence^ontains the code for generation of 

the peptide used to isolate the c-myc antibody. 5' end contains a truncated 
versionofthe mv upstream sequence(designated"TE).™s 5- urn contameda 

>5 22nucleotidesequencederivedfromtheTMV5'UTRencompassingtwo 
' ACAAAUUAC direct repeats (Gallic et al., Nucl. Acids Res. 16.883 (1988)). Figure 
7C shows Pool #1 (SBQ ID NO: 4), an exemplary sequence to be used for peptide 
selection. THefmal seven amino acids from the original mycpeptide were includedm 

the template to serve as the 3' constant region required for PGR amplification of the 
30 template, ^s sequence is known not to be part of the antibody binding epitope. 
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FIGURE 8 is a photograph demonstrating the S3aithesis of RNA-protein 
fusions using templates 43-P, LP77, and LP 154, and reticulocyte ("Retic") and wheat 
germ ("Wheat") translation systems. The left half of the figure illustrates ^^S 
methionine incorporation in each of the three templates. The right half of the figure 
5 illustrates the resulting products after RNase A treatment of each of the three 

templates to remove the RNA coding region; shown are ^^S methionine-labeled DNA- 
protein fusions. The DN A portion of each was identical to the oligo 30-P. Thus, 
differences in mobility were proportional to the length of the coding regions, 
consistent with the existence of proteins of different length in each case. 

1 0 FIGURE 9 is a photograph demonstrating protease sensitivity of an RNA- 

protein fusion synthesized from LP 154 and analyzed by denaturing polyacrylamide 
gel electrophoresis. Lane 1 contains ^^P labeled 30-P. Lanes 2-4, 5-7, and 8-10 
contain the ^^S labeled translation templates recovered from reticulocyte lysate 
reactions either without treatment, with RNase A treatment, or with RNase A and 

1 5 proteinase K treatment, respectively. 

FIGURE 10 is a photograph showing the results of inmnmoprecipitation 
reactions using in vitro translated 33 amino acid myc-epitope protein. Lanes 1 and 2 
show the translation products of the myc epitope protein and P-globin templates, 
respectively. Lanes 3-5 show the results of immunoprecipitation of the myc-epitope 

20 peptide using a c-myc monoclonal antibody and PBS, DB, and PBSTDS wash 

buffers, respectively. Lanes 6-8 show the same immunoprecipitation reactions, but 
using the P-globin translation product. 

FIGURE 1 1 is a photograph demonstrating immunoprecipitation of an 
RNA-protein fusion from an in vitro translation reaction. The picomoles of template 

25 used in the reaction are indicated. Lanes 1-4 show RNA 124 (the RNA portion of 
fusion LP 154), and lanes 5-7 show RNA-protein fusion LP 154. After 
immunoprecipitation using a c-myc monoclonal antibody and protein G sepharose, 
the samples were treated with RNase A and T4 polynucleotide kinase, then loaded on 
a denaturing urea polyacrylamide gel to visualize the fusion. In lanes 1-4, with 

30 samples containing either no template or only the RNA portion of the long myc 
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template (RNA124), no fusion was seen. In lanes 5-7. bands corresponding to the 
fusion were clearly visualized. Hxe position of 3^P labeled 30-P is indicated, and the 
amount of input template is indicated at the top of the figure. 

FIGURE 12 is a graph showing a quantitation of fusion material obtained 
5 from an in ^translation reaction. The intensity of the fiision bands shown in lanes 
5 VofFigurell and the 30-P band (isolated in a parallel fashion on dT„, not shown) 
were quantitated on phosphorimager plates and plotted as a function of input LP154 
concentration. Recovered modified 30-P (left y axis) was linearly proportional to 
input template (x axis), whereas linker-peptide fusion (right y axis) was constant. 
From this analysis, it was calculated that ~10« fusions were formed per ml of 
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translation reaction sample. 

HGURE 13 is a schematic representation of thiopropyl sepharose and dT,, 
agarose, and the abiUty of these substrates to interact with the RNA-protein fusions of 

the invention. , 

FIGURE 14 is a photograph showing the results of sequential isolation of 
fusions of the invention. Lane 1 contains -P labeled 30-P. Lanes 2 and 3 show 
LPl 54 isolated from translation reactions and treated with RNase A. In lane 2, LP154 
was isolated sequentially, using thiopropyl sepharose followed by dT,, agarose. Lane 
3 shows isolation using only dT,, agarose. The results indicated that the product 
20 contained a free tiuol. likely the penultimate cysteine in the myc epitope coding 
sequence. 

FIGURES 15Aand 15B are photographs showing the formation of fusion 
products using P-giobin templates as assayed by SDS-tricine-PAGE (polyacrylamide 
gel electrophoresis). Figure 15A shows incorporation of using either no template 
25 (lanel),asyn-P-globintemplate(lanes2-4),oranLP-P-globintemplate(lanes5-7). 

Figure 15B (lanes labeled as in Fig. 15A) shows -S-labeled material isolated by 
oligonucleotide affinity chromatography. No material was isolated in the absence of a 

30-P tail (lanes 2-4). 

FIGURES 16A-16C are diagrams and photographs Ulustrating enrichment 
of myc dsDNA versus pool dsDNA by in yito selection. Figure 16A is a schematic 
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of the selection protocol. Four mixtures of the myc and pool templates were 
translated in vitro and isolated on dTjs agarose followed by TP sepharose to purify the 
template fusions from unmodified templates. The mRNA-peptide fusions were then 
reverse transcribed to suppress any secondary or tertiary structure present in the 
5 templates. Aliquots of each mixture were removed both before (Figure 16B) and after 
(Figure 16C) affmity selection, amplified by PGR in the presence of a labeled primer, 
and digested with a restriction enzyme that cleaved only the myc DNA. The input 
mixtures of templates were pure myc (lane 1), or a 1 :20, 1 :200, or 1:2000 mycipool 
(lanes 2-4). The unselected material deviated from the input ratios due to preferential 

1 0 translation and reverse transcription of the myc template. The enrichment of the myc 
template during the selective step was calculated from the change in the poolrmyc 
ratio before and after selection. 

FIGURE 17 is a photograph illustrating the translation of myc RNA 
templates. The following linkers were used: lanes 1-4, dA27dCdCP; lanes 5-8, 

15 dA27rCrCP; and lanes 9-12, dA2,C9C9C9dAdCdCP. In each lane, the concentration of 
RNA template was 600 nM, and ^^S-Met was used for labeUng. Reaction conditions 
were as follows: lanes 1, 5, and 9, 30°C for 1 hour; lanes 2, 6, and 10, SO^'C for 2 
hours; lane 3, 7, and 1 1, 30° C for 1 hour, -20°C for 16 hours; and lanes 4, 8, and 12, 
30**C for 1 hour, -20''C for 16 hours with 50 mM Mg^*. In this Figure, "A" represents 

20 free peptide, and "B" represent mRNA-peptide fusion. 

FIGURE 18 is a photograph illustrating the translation of myc RNA 
templates labeled with The linker utilized was dA2,C9C9C9dAdCdCP. 
Translation was performed at 30°C for 90 minutes, and incubations were carried out 
at -20**C for 2 days without additional Mg^*. The concentrations of mRNA templates 

25 were 400 nM (lane 3), 200 nM (lane 4), 100 nM (lane 5), and 100 nM (lane 6). Lane 
1 shows mRNA-peptide fusion labeled with ^^S-Met. Lane 2 shows mRNA labeled 
with ^^P. In lane 6, the reaction was carried out in the presence of 0.5 mM cap analog. 

FIGURE 19 is a photograph illustrating the translation of myc RNA 
template using lysate obtained from Ambion (lane 1), Novagen (lane 2), and 

30 Amersham (lane 3). The linker utilized was dA27dCdCP. The concentration of the 
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template was 600 nM. and «S-Met was used for labeling. Translations were 
perfonned at 30-C for 1 hour, and incubations were carried out at -20»C overnight m 
the presence of 50 niM Mg^*. 

Described herein is a general method for the selection of proteins with 
5 desired functions using fusions in which these proteins are covalently linked to their 
own messenger RNAs. These RNA-protein fusions are synthesized by in Yltia or m 
Situ translation of mRNA pools containing a peptide acceptor attached to the^r 3' ends 
(Figure IB). In one preferred embodiment, after readthroughofthe open reading 
frame of the message, the ribosome pauses when it reaches the designed pause site, 
and the acceptor moiety occupies the ribosomal A site and accepts the nascent pepUde 
chain from the peptidyl-tRNA in the P site to generate the RNA-protein fusion 
(Figure IC). The covalent link between the protein and the RNA (in the fomi of an 
amide bond between the 3' end of the mRNA and the Ctermmus of the protein which 
it encodes) allows the genetic infomiation in the protein to be recovered and amplified 
(eg by PGR) following selection by reverse transcription of the RNA. Once the 
fiision is generated, selection or enrichment is canied out based on the properties of 
the mRNA-protem fusion, or. alternatively, reverse transcription may be earned out 
using the mRNA template while it is attached to the protem to avoid any effect of the 
single-stranded RNA on the selection. When the mRNA-protein construct is used, 
selected fusions may be tested to determine which moiety (the protein, the RNA, or 
both) provides the desired function. 

In one preferred embodiment, puromycin (which resembles tyrosyl 
adenosine) acts as the acceptor to attach the gnawing peptide to its mRNA. 
Puromycin is an antibiotic that acts by tenninating peptide elongation. As a mimetic 
of aminoacyl-tRNA, it acts as a universal inhibitor of protein synthesis by binding the 
A site, accepting the growing peptide chain, and falling off the ribosome (at a Kd = 
10-* M) (Traut and Monro, J. Mol. Biol. 10:63 (1964); Smith et al., 3. Mol. Biol. 
13-617 (1965)). One of the most attractive features of puromycin is the fact that it 
forms a stable amide bond to the growing peptide chain, thus allowing for more stable 
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fusions than potential acceptors that fonn unstable ester linkages. In particular, the 
peptidyl-puromycin molecule contains a stable amide linkage between the peptide and 
the O-methyl tyrosine portion of the puromycin. The O-methyl tyrosine is in turn 
linked by a stable amide bond to the 3'-amino group of the modified adenosine 

5 portion of puromycin. 

Other possible choices for acceptors include tRNA-like structures at the 3* 
end of the mRNA, as well as other compounds that act in a manner similar to 
puromycin. Such compounds include, without limitation, any compound which 
possesses an amino acid linked to an adenine or an adenine-like compound, such as 

10 the amino acid nucleotides, phenylalanyl-adenosine (A-Phe), tyrosyl adenosine (A- 
Tyr), and alanyl adenosine (A- Ala), as well as amide-linked structures, such as 
phenylalanyl 3' deoxy 3* amino adenosine, alanyl 3* deoxy 3* amino adenosine, and 
tyrosyl 3* deoxy 3' amino adenosine; in any of these compounds, any of the naturally- 
occurring L-amino acids or their analogs may be utilized. In addition, a combined 

1 5 tRNA-like 3' structure-puromycin conjugate may also be used in the invention. 

Shown in Figure 2 is a preferred selection scheme according to the 
invention. The steps involved in this selection are generally carried out as follows. 

Step 1 . Preparation of the DNA template. As a step toward generating 
the RNA-protein fusions of the invention, the RNA portion of the fusion is 

20 synthesized. This may be accomplished by direct chemical RNA synthesis or, more 
commonly, is accomplished by transcribing an appropriate double-stranded DNA 
template. 

Such DNA templates may be created by any standard technique (including 
any technique of recombinant DNA technology, chemical synthesis, or both). In 

25 principle, any method that allows production of one or more templates containing a 
known, random, randomized, or mutagenized sequence may be used for this purpose. 
In one particular approach, an oligonucleotide (for example, containing random bases) 
is synthesized and is amplified (for example, by PGR) prior to transcription. 
Chemical synthesis may also be used to produce a random cassette which is then 

30 inserted into the middle of a known protein coding sequence (see, for example. 
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RNA-protein fusion may be chemically synthesized using standard techniques of 
oligonucleotide synthesis. Alternatively, and particularly if longer RNA sequences 
are utilized, the RNA portion is generated by in vitro transcription of a DNA template. 
In one preferred approach, T7 polymerase is used to enzymatically generate the RNA 

5 strand. Other appropriate RNA polymerases for this use include, without limitation, 
the SP6, T3 and K fiOli RNA polymerases (described, for example, in Ausubel et al. 
(supra, chapter 3). In addition, the synthesized RNA may be, in whole or in part, 
modified RNA. In one particular example, phosphorothioate RNA may be produced 
(for example, by T7 transcription) using niodified ribonucleotides and standard 

10 techniques. Such modified RNA provides the advantage of being nuclease stable. 

Step 3. Ligation of Puromvcin to the Template. Next, puromycin (or any 
other appropriate peptide acceptor) is covalently bonded to the template sequence. 
This step may be accomplished using T4 RNA ligase to attach the puromycin directly 
to the RNA sequence, or preferably the puromycin may be attached by way of a DNA 

15 "splint" using T4 DNA ligase or any other enzyme which is capable of joining 

together two nucleotide sequences (see Figure IB) (see also, for example, Ausubel et 
al., supra, chapter 3, sections 14 and 15). tRNA synthetases may also be used to 
attach puromycin-like compounds to RNA. For example, phenylalanyl tRNA 
synthetase links phenylalanine to phenylalanyl-tRNA molecules containing a 3' amino 

20 group, generating RNA molecules with puromycin-like 3* ends (Fraser and Rich, 

Proc. Natl. Acad. Sci. USA 70:2671 (1973)). Other peptide acceptors which may be 
used include, without limitation, any compound-which possesses an amino acid linked 
to an adenine or an adenine-like compound, such as the amino acid nucleotides, 
phenylalanyl-adenosine (A-Phe), tyrosyl adenosine (A-Tyr), and alanyl adenosine (A- 

25 Ala), as well as amide-linked structures, such as phenylalanyl 3' deoxy 3* amino 
adenosine, alanyl 3* deoxy 3* amino adenosine, and tyrosyl 3' deoxy 3* amino 
adenosine; in any of these compounds, any of the naturally-occurring L-amino acids 
or their analogs may be utilized. A number of peptide acceptors are described, for 
example, in Krayevsky and Kukhanova, Progress in Nucleic Acids Research and 

30 Molecular Biology 23:1 (1979). 
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c.„r ^ r..n^tinn n n i P— A-Protem Fw^ions. To generate 
RN A-protein fusions, any in sittQ or in sto translation system may be utilized. As 
shown below, eukaryotic systems are preferred, and two particularly preferred 
systems include the wheat germ and reticulocyte lysate systems. In principle. 
5 however, any translation system which allows formation of an RNA-protein fusion 
and which does not significantly degrade ti.e RNA portion of the fusion is useful m 
the invention. In addition, to reduce RNA degradation in any of these systems, 
degradation-blocking antisense oligonucleotides may be included in the teanslation 
reaction mixtu«; such oUgonucleotides specifically hybridize to and cover sequences 
10 witirin tfie RNA portion of tiie molecule that trigger degradation (see, for example. 
Hanes and Pluckttxun, Proc. Nati. Acad. Sci USA 94:4937 (1997)). 

As noted above, any number of eukaryotic translation systems are 
available for use in tiie invention. These include, without limitation, lysates from 
yeast, ascites, tumor ceUs (Leibowitz et al., Meth. Enzymol. 194:536 (1991)), and 
15 xenopus oocyte eggs. Useful in yitta translation systems from bacterial systems 
include, without limitation, those described in Zubay (Ann. Rev. Genet. 7:267 
(1973)); Chen and Zubay (Meth. Enzymol. 101:44 (1983)); and EUman (Meth. 

Enzymol. 202:301 (1991)). 

m addition, translation reactions may be carried out in sim. In one 
20 particular example, translation may be carried out by injecting mRN A into Xenopus 
eggs using standard techniques. 

Once generated, RNA-protein fusions may be recovered from tiie 
translation reaction mixture by any standard technique of protein or RNA purification. 
Typically, protein purification techniques are utilized. As shown below, for example. 
25 purification of a fiision may be facilitated by tiie use of suitable chromatographic 
reagents such as dT,, agarose or tiiiopropyl sepharose. Purification, however, may 
also or alternatively involve purification based upon the RNA portion of the fusion; 
techniques for such purification are described, for example in Ausubel et al. (ssm. 
chapter 4). 

30 < ^^ ^ < g^io^-tinn a m - P^ir»^ TtNA-Protein Fusion, Selection of a 
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desired RNA-protein fusion may be accomplished by any means available to 
selectively partition or isolate a desired fusion from a population of candidate fusions. 
Examples of isolation techniques include, without limitation, selective binding, for 
example, to a binding partner which is directly or indirectly immobilized on a column, 

5 bead, membrane, or other solid support, and immimoprecipitation using an antibody 
specific for the protein moiety of the fusion. The first of these techniques makes use 
of an immobilized selection motif which can consist of any type of molecule to which 
binding is possible. A list of possible selection motif molecules is presented in Figure 
2. Selection may also be based upon the use of substrate molecules attached to an 

10 affinity label (for example, substrate-biotin) which react with a candidate molecule, or 
upon any other type of interaction with a fusion molecule. In addition, proteins may 
be selected based upon their catalytic activity in a manner analogous to that described 
by Bartel and Szostak for the isolation of RNA enzymes (supraV according to that 
particular technique, desired molecules are selected based upon their ability to link a 

1 5 target molecule to themselves, and the functional molecules are then isolated based 
upon the presence of that target. Selection schemes for isolating novel or improved 
catalytic proteins using this same approach or any other functional selection are 
enabled by the present invention. 

In addition, as described herein, selection of a desired RNA-protein fusion 

20 (or its DNA copy) may be facilitated by enrichment for that fusion in a pool of 
candidate molecules. To carry out such an optional enrichment, a population of 
candidate RNA-protein fusions is contacted with a binding partner (for example, one 
of the binding partners described above) which is specific for either the RNA portion 
or the protein portion of the fusion, under conditions which substantially separate the 

25 binding partner-fusion complex from unbound members in the sample. This step may 
be repeated, and the technique preferably includes at least two sequential enrichment 
steps, one in which the fusions are selected using a binding partner specific for the 
RNA portion and another in which the fusions are selected using a binding partner 
specific for the protein portion. In addition, if enrichment steps targeting the same 

30 portion of the fusion (for example, the protein portion) are repeated, different binding 
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partners are preferably utilized. In one particular example described herein, a 
population of molecules is enriched for desired fusions by first using a binding partner 
specific for the RNA portion of the fusion and then, in two sequential steps, using two 
different binding partners, both of which are specific for the protein portion of the 
5 fiision. Again, these complexes may be separated fix>m sample components by any 
standard separation technique including, without limitation, column affimty 
chromatography, centrifugation, or immunoprecipitation. 

Moreover, elution of an RNA-protein fusion from an enrichment (or 
selection) complex may be accomplished by a number of approaches. For example, 
1 0 as described herein, one may utilize a denaturing or non-specific chemical elution step 
to isolate a desired RNA-protein fiision. Such a step facilitates the release of complex 
components from each other or from an associated solid support in a relatively non- 
specific mamxer by breaking non-covalent bonds between the components and/or 
between the components and the solid support. As described herein, one exemplary 
15 denaturingornon-specificchemicalelutionreagentis4o/oHOAc/HA Other 

exemplary denaturing or non-specific chemical elution reagents include guamdme, 
urea high salt, detergent, or any other means by which non-covalent adducts may 
generally be removed. Alternatively, one may utilize a specific chemical elutxcn 
approach, in which a chemical is exploited that causes the specific release of a fusion 
20 molecule. l„ one particular example, ifthe linker arm of a desired fiisionprotem 
contains one or more disulfide bonds, bound fiision aptamers may be eluted by the 
addition, for example, of DTT, resulting in the reduction of the disulfide bond and 

release of the bound target. 

Alternatively, elution may be accomplished by specifically disrupting 
25 affinity complexes; such techniques selectively release complex components by the 
addition of an excess of one member of the complex. For example, in an ATP- 
binding selection, elution is performed by the addition of excess ATP to the 
incubation mixture. Finally, one may carry out a step of enzymatic elution. Bythis 
approach, a bound molecule itself or an exogenously added protease (or other 
30 appropriate hydrolytic enzyme) cleaves and releases either the target or the enzyme. 
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In one particular example, a protease target site may be included in either of the 
complex components, and the bound molecules elutcd by addition of the protease. 
Alternately, in a catalytic selection, elution may be used as a selection step for 
isolating molecules capable of releasing (for example, cleaving) themselves from a 
5 solid support. 

Step 6. Gen eration of a DNA Conv of the RNA Sequence using Reverse 
Transcriptase. If desired, a DNA copy of a selected RNA fusion sequence is readily 
available by reverse transcribing that RNA sequence using any standard technique 
(for example, using Superscript reverse transcriptase). This step may be carried out 
10 prior to the selection or enrichment step (for example, as described in Figure 16), or 
following that step. Alternatively, the reverse transcription process may be carried 
out prior to the isolation of the fusion from the in vitro or in situ translation mixture. 

Next, the DNA template is amplified, either as a partial or full-length 
double-stranded sequence. Preferably, in this step, full-length DNA templates are 
1 5 generated, using appropriate oligonucleotides and PCR amplification. 

These steps, and the reagents and techniques for carrying out these steps, 
are now described in detail using particular examples. These examples are provided 
for the purpose of illustrating the invention, and should not be construed as limiting. 

GENERATION OF TEMPLATES FOR RN A-PROTEIN FUSIONS 
20 As shown in Figures 1 A and 2, the selection scheme of the present 

invention preferably makes use of double-stranded DNA templates which include a 
number of design elements. The first of these elements is a promoter to be used in 
conjunction with a desired RNA polymerase for mRNA synthesis. As shown in 
Figure 1 A and described herein, the T7 promoter is preferred, although any promoter 
25 capable of directing synthesis from a linear double-stranded DNA may be used. 

The second element of the template shown in Figure 1 A is termed the 5' 
untranslated region (or 5'UTR) and corresponds to the RNA upstream of the 
translation start site. Shown in Figure 1 A is a preferred SUTR (termed "TE") which 
is a deletion mutant of the Tobacco Mosaic Virus 5' untranslated region and, in 
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j . u„c»oHir«-tlv 5' oftheTMV translation start; the 
particular, corresponds to the bases directly 5 oi tne i , „ tt aw- 

« ^w-w-j rArCrA rArUrU rArCrU rArUrU rUrArC 
sequence of this UTR is as follows: rGiGrG rAiCrA rAruru 

rArArU rUrArC rA (with the first 3 G nucleotides being inserted to augment 
transcription)(SEQIDNO:5).A.yotherappropriate5.I^xnaybeutilized(see. 

5 for example, Kozak. Microbiol. Rev. 47: 1 (1983)). 

The third element shown in Figure 1 A is the translation start s,te. In 
general, Ms is an AUG codon. However, there are examples where codons other than 
AUG are utilized innaturally-occurring coding sequences, and these codons may also 

be used in the selection scheme of the invention. 
10 The fourth element in Figure i A is the open reading frame of the protem 

(termed ORF), which encodes the protein sequence. This open reading frame may 
encode any naturally-occurring, random, randomized, mutagenized. or totally 

synthetic protein sequence. 

The fldh dem-n* shown in Figure 1 A is the 3' constant region. Th,s 
,5 sequence faciUutesPCRampb8c«ionofa.e pool sequences and lig-ion of the 

puro^ycin— ngoUgonue.eo.idetothe.KKA. if desired, this regronn^yaiso 

,„c,ude a pause site, a sequence which causes the riboson,e to pause and the^y 
..iowsaddUional time for an acceptor n,oie„(*.rexan^le.P«ron,ycin) to aecepta 

nascent peptide chain fiom the peptidyi-tRNA; this pause site is diseased .nn^re 

20 deuil below^^ ^^^^ ^ ^ methodology. RNA-p».ein fusions were initially 

generated using highly simplified mBNA ten^hues c«>taimng 1-2 codons^ Th. 
approachwastalcenf^tworeasona. First. t^nplatesofthissi^couldrcaddy be 

„^e by chemical synd«si. And. second, a sm^l open »«Ung ft«ne allowed 
25 critical features of the reaction, including efficiency of linkage, end heterogeu^ty. 
.emplate dependence, and accuracy of translation, to be readily assayed. 

Liga^fC^. Ahasicconst^ctwasusedforgeneratingtestBKA- 
p^li^ons. Themoleouleconsistedot.mRKAcontainingaShine-Dalg^^^ 
SD)sequeneefortra.slaUoniniti«io„whichcon.ai„eda3basedeleU».of,heSD 
30 Je„ceftomribosom,lp.oteinLlandwhichwascomp.ementaryto5bas.sofl6S 
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rRNA (i.e., rGrGrA rGrGrA rCrGrA rA) (SEQ ID NO: 6) (Stormo et al., Nucleic 
Acids Research 10:2971-2996 (1982); Shine and Dalgamo, Proc. Natl. Acad. Sci. 
USA 71:1342-1346 (1974); and Steitz and Jakes, Proc. Natl. Acad. Sci. USA 
72:4734-4738 (1975)), (ii) an AUG start codon, (iii) a DNA linker to act as a pause 

5 site (i.e., 5'-(dA)27), (iv) dCdC-3', and (v) a 3* puromycin (P). The poly dA sequence 
was chosen because it was known to template tRNA poorly in the A site (Morgan et 
aL, J. Mol. Biol. 26:477-497 (1967); Ricker and Kaji, Nucleic Acid Research 
19:6573-6578 (1991)) and was designed to act as a good pause site. The length of the 
oHgo dA linker was chosen to span the -60-70 A distance between the decoding site 

1 0 and the peptidyl transfer center of the ribosome. The dCdCP mimicked the CCA end 
of a tRNA and was designed to facilitate binding of the puromycin to the A site of the 
ribosome. : 

rhemical Synthesis of Min imal Template 43-P. To synthesize construct 
43-P (shown in Figure 3), puromycin was first attached to a solid support in such a 

1 5 way that it would be compatible with standard phosphoramidite oligonucleotide 

synthesis chemistry. The synthesis protocol for this oligo is outlined schematically in 
Figure 3 and is described in more detail below. To attach puromycin to a controlled 
pore glass (CPG) solid support, the amino group was protected with a trifluoroacetyl 
group as described in Applied Biosystems User Bulletin #49 for DNA synthesizer 

20 model 380 (1 988). Next, protection of the 5' OH was carried out using a standard 

DMT-Cl approach (Gait, Oligonucleotide Synthesis a practical approachThe Practical 
Approach Series (IRL Press, Oxford, 1984)), and attachment to aminohexyl CPG 
through the 2' OH was effected in exactly the same fashion as the 3' OH would be 
used for attachment of a deoxynucleoside (see Fig. 3 and Gait, supra, p. 47). The 5' 

25 DMT-CPG-linked protected puromycin was then suitable for chain extension with 
phosphoramidite monomers. The synthesis of the oligo proceeded in the 3' -> 5* 
direction in the order: (i) 3* puromycin, (ii) pdCpdC, (iii) -27 units of dA as a linker, 
(iv) AUG, and (v) the Shine-Dalgamo sequence. The sequence of the 43-P construct 
is shown below. 

30 Synthesis of CPG Puromvcin . The synthesis of protected CPG puromycin 
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followed .he genera, pa*u3.d.^deoxyn»de»ide..spr«vious,,ou.««d («^^^^ 

tolioweaiii K Th. Practical AopioiichSenes (IRL 

01igo„uoleotideSynthesis,APract,calApp«.ach,ThePiac«cal pp 

IL oxford, 1984,). Major dep».ures included selecUon of an appropnaleN 

5 rrsolluppon. ,nd,ecase.fd.la.»,««re^on„aacarHedo„.a.ve„lo« 
:::„Li„n::fac.iva«dn»c,«.d.,s,.is.„a.crta.«aasi.^ 

d,an *e solid support. TTe resuWng yield (-20 ^moVg support) w,s <,»,.e 
sa.isfac.oryconsidmng.l.edil».ere«»ionconditiom. 

Svnttosia.mTrinil(m.o-1vl^<™-'. 267 mg (0.490 uunol) 
,0 P„romycin.HC.wasfirs.co„ver.ed«.a.ef^ba,efonnbydiss„.vi«gin-«r 

! IgpH 11 ca,bona.ebu«er.->dexUac,i.g<3X,i.»ehlor„«- Theo^iC 
rievap„ra.ed.«d.ynessa„d„.gh.(242n.g.0.513.™ol,^T^^^^^^^ 
Ld,end.ssolvedinll.nld,ypynaineand,,,nld,yace»n,U..e.a„dl39.(^0 

„„ol) ■r.e.hylamine (TEA, a,^ .39 0-0 nunO of -HHuoroaoeUc «*>^de 
,3 ;;^„ereaddedwi,Ks«ng. TTAA «as men add«. » «.e sol,«.on ,„ 20 
ZZ-„.i.noneofd»s..«ingn.a.eHa,re„»incd..sass,,.dl,yd,in..yer 

ltIograp.y,*)<93:^»or„^eO>0,a.o..of 280.0. Tbe.^^^^^^^ 
allowedto proceed forone hour. A.«n.P<.in.^.obands»erereve^ed y» 

:La.„g:phy,.o.hofMghe..no.i.i.d«-3«ingn.a»a.^W^^^ 
20 reac.ionwi,hNH,OHand»a«rreducediheproduc..oa.mg.eband. S lea 

hv ,93-7 ChloroformMeOH) yielded 293 mg (0.515 mmol) of flre 

rN'^trCroduc.ord.sreac.on.shown^^^^ 

ahovereac.ion„asali<,uo.ed-coevapora.».2Xwid.d.ypyHdineU.ren,~^ 
.5 Mul,ip,e.ubeswereprep.redu..ea.mul.iplere«Uo„condi.o,«. In a snrall^ale 
" :!:n,27.4.g(4S.2.n„les)H.^A-Purweredi3»lvedin«0p.opy^^^^^ 

conlaimng 0.05 e, of DMAP and 1 .4 eq TEA. To Una nnxh-re. 20.6 n.g of .nty 
contmm g M ^ ,^ completion 

chloride (60 pmol, was added, and the reaction w 

withstirring. The reaction was stopped by addition of annual volume of wa.« 
30 Cr— y500l.,)to.hesolution. Because thU reaCon appeared successM , a 
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large scale version was performed. In particular, 262 mg (0.467 mmol) N-TFA-Pur 
was dissolved in 2.4 ml pyridine followed by addition of 1.4 eq of TEA, 0.05 eq of 
DMAP, and 1.2 eq of trityl chloride. After approximately two hours, an additional 50 
mg (0.3 eq) dimethoxytrityPCl (DMT*C1) was added, and the reaction was allowed 

5 to proceed for 20 additional minutes. The reaction was stopped by the addition of 3 
ml of water and coevaporated 3X with CH3CN. The reaction was purified by 95 :5 
Chloroform/MeOH on a 100 ml silica (dry) 2 mm diameter column. Due to 
incomplete purification, a second identical column was run with 97.5:2.5 
Chloroform/MeOH. The total yield was 325 mg or 0.373 nmiol (or a yield of 72%). 

10 The product of this reaction is shown schematically in Figure 4. 

Synthesis of N-TrifluoroacetvL 5'-DM T, T Succinvl Puromvcin. In a 
small scale reaction, 32 mg (37 ^mol) of the product synthesized above was combined 
with 1.2 eq of DMAP dissolved in 350 \k\ of pyridine. To this solution, 1.2 
equivalents of succinic anhydride was added in 44 jil of dry CH3CN and allowed to 

1 5 stir ovemight. Thin layer chromatography revealed little of the starting material 

remaining. In a large scale reaction, 292 mg (336 jimol) of the previous product was 
combined with 1 .2 eq DMAP in 3 ml of pyridine. To this, 403 \i\ of IM succinic 
anhydride in dry CH3CN was added, and the mixture was allowed to stir ovemight. 
Thin layer chromatography again revealed little of the starting material remaining. 

20 The two reactions were combined, and an additional 0.2 eq of DMAP and succinate 
were added. The product was coevaporated with toluene IX and dried to a yellow 
foam in high vacuum. CHjClj was added (20 ml), and this solution was extracted 
twice with 15 ml of 10% ice cold citric acid and then twice with pure water. The 
product was dried, redissolved in 2 ml of CHjClj, and precipitated by addition of 50 

25 ml of hexane with stirring. The product was then vortexed and centrifuged at 600 rpm 
for 10 minutes in the clinical centrifuge. The majority of the elucnt was drawn off, 
and the rest of the product was dried, first at low vacuum, then at high vacuum in a 
dessicator. The yield of this reaction was approximately 260 ^mol for a stepwise 
yield of'«70%. 
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L 2 hours The insoluble oyelohexy. «.e. p^-duced by .he re«:Bon was 

' — o. 

The resin was then washed wiO. DMF, meO^not and eflrer. and dned The resul, g 
riassayed..con*ing22..p.no>o«.ylpers,wel.wi,hind,eaccep^^^^^^ 

■ ,„heredoligomayben«dtoeilhorottwow.yslogeneraBa 
puronrycnredre^dohgo J ,^„„^„«e. For extremely 

„RNA-pnromyc« eon,„ga.e whreh »=.s as ^.^ 
1, ^ ™™ peadine frames, ae puromycin ohgo is typicauy e». 
TaIZ — .crea«a.uny.,n«»«c.«npla.e. When lo„g» open 
,„ rrrlsaredesired,d.eR>.A„rDNAo,igoisgenera.l,Ug«ed.od,e.e^or 
:t^Ausi„gaONAsplin.andT4DNA«gaseasdeseHbedbyMo»ea™lSharp 

(Science 256:992 (1992)). 

1 1 1 n^ilTTT n TTT ^>T^T /'""^ 
j^oTT>^^T PNA-PPOTFTN FTT8T0NS 

andeukaryoticinyitmtranslationsystemsasfollows. 

^.v^ere added to several different ia jdlEfi translation 
RNA-puromycm conjugates were added Q^^^^ay.Ann. 
systems including: (i) the S30 system denved from 1. eoll MRb 
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Rev. Genet. 7:267 (1973); Collins, Gene 6:29 (1979); Chen and Zubay, Methods 
Enzymol, 101 :44 (1983); Pratt, in Transcription and Translation: A Practical 
Approach, B. D. Hammes, S. J. Higgins, Eds. (IRL Press, Oxford, 1984) pp. 
179-209; and Ellman et al.. Methods Enzymol. 202:301 (1991)) prepared as described 
5 by Ellman et. al. (Methods Enzymol. 202:301 (1991)); (ii) the ribosomal fraction 
derived from the same strain, prepared as described by Kudlicki et al. (Anal. Chem. 
206:389 (1992)); and (iii) the S30 system derived from cqH BL21, prepared as 
described by Lesley et al. (J. Biol. Chem. 266:2632 (1991)). In each case, the premix 
used was that of Lesley et al. (J. Biol. Chem. 266:2632 (1991)), and the incubations 

1 0 were 30 minutes in duration. 

Testing the Nature of the Fusion . The 43-P template was first tested using 
S30 translation extracts from E^ soli- Figure 5 (Reaction "A") demonstrates the 
desired intramolecular (cis) reaction wherein 43-P binds the ribosome and acts as a 
template for and an acceptor of fMet at the same time. The incorporation of 

1 5 ^^S-methionine and its position in the template was first tested, and the results are 
shown in Figures 6A and 6B. After extraction of the in vitro translation reaction 
mixture with phenol/chloroform and analysis of the products by SDS-PAGE, an ^^S 
labeled band appeared with the same mobility as the 43-P template. The amount of 
this material synthesized was dependent upon the Mg^^ concentration (Figure 6A). 

20 The optimum Mg^" concentration appeared to be between 9 and 1 8 mM, which was 
similar to the optimum for translation in this system (Zubay, Ann. Rev. Genet. 7:267 
(1973); Collins, Gene 6:29 (1979); Chen and Zubay, Methods Enzymol, 101:44 
(1983); Pratt, in Transcription and Translation: A Practical Approach, B. D. 
Hammes, S. J. Higgins, Eds. (IRL Press, Oxford, 1984) pp. 179-209; Ellman et al., 

25 Methods Enzymol. 202:301 (1991); Kudlicki et al.. Anal. Chem. 206:389 (1992); and 
Lesley et al., J. Biol. Chem. 266:2632 (1991)). Furthermore, the incorporated label 
was stable to treatment with NH4OH (Figure 6B), indicating that the label was located 
on the 3' half of the molecule (the base-stable DNA portion) and was attached by a 
base-stable linkage, as expected for an amide bond between puromycin and fMct. 

30 Ribosome and Template Dependence . To demonstrate that the reaction 
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observed above occurred on the ribosome. the effects of specific inhibitors of the 
peptidyl transferase function of the ribosome were tested (Figure 6C). and the effect 
of changing the sequence coding for methionine was examined (Figure 6D). Figure 
6C demonstr^es clearly that the reaction was strongly inhibited by the peptidyl 
5 transferase inhibitors, virginiamycin, gougerotin, and chloramphenicol (Monro and 
Vazquez. J. Mol. Biol. 28:161-165 (1967); and Vazquez and Monro, Biochemica et 
Biophysical Acta 142:155-173 (1967)). Figure 6D demonstrates that changing a 
single base in the template from A to C abolished incorporation of -S methionine at 9 
mM Ms-, and greatly decreased it at 1 8 mM (consistent with the fact that high levels 
10 ofMg- allow misreading of the message). These experiments demonstrated that the 
reaction occurred on the ribosome in a template dependent fashion. 

T inker T^eth. Also tested was the dependence ofthe reaction on the 
length ofthe linker (Figure 6E). The original template was designed so that the linker 
spanned the distance from the decoding site (occupied by the AUG ofthe template) to 
1 5 the acceptor site (occupied by the puromycin moiety), a distance which was 

approximately the same length as the distance between the anticodon loop and the 
acceptor stem in a tRNA. or about 60-70 A. The first linker tested was 30 nucleotides 
in length, based upon a minimum of 3.4 A per base (. 1 02 A). In the range between 
30 and 21 nucleotides (n = 27 - 18; length . 102 - 71 A), little change was seen m the 
20 efficiency ofthe reaction. Accordingly, linker length may be varied. Whrle a linker 
of between 21 and 30 nucleotides represents a prefenred length, linkers shorter than 80 
nucleotides and. preferably, shorter than 45 nucleotides may also be utilized m the 
invention. 

T^^ ^ ^ ^^„l.r V. Tr ^^r^»i.>>n1.r Reactions. Finally, we tested whether 
25 the reaction occurred in an intramolecular fashion (Figure 5, Reaction "A") as desired 
or intermolecularly (Figure 5, Reaction "B")- This was tested by adding 
oligonucleotides with 3' puromycin but no ribosome binding sequence (i.e.. templates 
25-P, 13-P, and 30-P) to the translation reactions containing the 43-P template 
(Figilres 6F. 6G. and 6H). If the reaction occurred by an intermolecular mechanism. 
30 the shorter oligos would also be labeled. As demonstrated in Figures 6F-H. there was 
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little incorporation of ^^S methionine in the three shorter oligos, indicating that the 
reaction occurred primarily in an intramolecular fashion. The sequences of 25-P 
(SEQ ID NO: 10), 13-P (SEQ ID NO: 9), and 30-P (SEQ ID NO: 8) are shown below. 

RetiGulocvte Lysate . Figure 6H demonstrates that ^^S-methionine may be 
5 incorporated in the 43-P template using a rabbit reticulocyte lysate (see below) for in 
vitro translation, in addition to the coli lysates used above. This reaction occurred 
primarily in an intramolecular mechanism, as desired. 

SYNTHESIS AND TESTING OF FUSIONS 
CONTAINING A C-MYC EPITOPE TAG 

1 0 Exemplary fusions were also generated which contained, within the 

protein portion, the epitope tag for the c-myc monoclonal antibody 9E10 (Evan et al., 
Mol. Cell Biol. 5:3610 (1985)). 

Design of Templates . Three initial epitope tag templates (i.e., LP77, 
LPl 54, and Pool #1 ) were designed and are shown in Figures 7A-C. The first two 

1 5 templates contained the c-myc epitope tag sequence EQKLISEEDL (SEQ ID NO: 2), 
and the third template was the design used in the synthesis of a random selection pool. 
LP77 encoded a 12 amino acid sequence, with the codons optimized for bacterial 
translation. LPl 54 and its derivatives contained a 33 amino acid mRNA sequence in 
which the codons were optimized for eukaryotic translation. The encoded amino acid 

20 sequence of MAEEQKLISEEDLLRKRREQKLKHKLEQLRNSCA (SEQ ID NO: 7) 
corresponded to the original peptide used to isolate the 9E10 antibody. Pool#l 
contained 27 codons of NNG/C (to generate random peptides) followed by a sequence 
corresponding to the last seven amino acids of the myc peptide (which were not part 
of the myc epitope sequence). These sequences are shown below. 

25 Reticulocyte vs. Wheat Germ In Vitro Transla tion Svstems. The 43-P, 

LP77, and LP 154 templates were tested in both rabbit reticulocyte and wheat germ 
extract (Promega, Boehringer Mannheim) translation systems (Figure 8). 
Translations were performed at 30*'C for 60 minutes. Templates were isolated using 
dTjs agarose at 4**C. Templates were eluted fi-om the agarose using 15 mM NaOH, 1 



PCT/US98/00807 

• WO9W31700 

-34- 

mU EDTA. neutralized with NaOAcmOAc buffer, immediately ethanol precipitated 
(25-3 vol) washed (with 100% ethanol). and dried on a speedvac concentrator. 
Figure 8 shows that «S methionine was incorporated into all three templates, in both 
the wheat germ and reticulocyte systems. Less degradation of the template was 
5 observed in the fusion reactions from the reticulocyte system and. accordmgly. th,s 
system is preferred for the generation of RNA-protein fusions. In addition, m general, 
eukaryotic systems are preferred over bacterial systems. Because eukaryotic cells 
tend to contain lower levels of nucleases. mRNA lifetimes are generally 10-100 times 
longer in these cells than in bacterial cells. In experiments using one particular E. Sfill 
10 translation system, generation of fosions was not observed using a template encodmg 
the c-myc epitope; labeling the template in various places demonstrated that this was 
likely due to degradation of both the RNA and DNA portions of the template. 

To examine the peptide portion of these fusions, samples were treated with 
RNase to remove the coding sequences. Following this treatment, the 43-P product 
15 ran with almost identical mobility to the labeled 30-P oligo. consistent with a very 
small peptide (perhaps only methionine) added to 30-P. For LP77. removal of the 
coding sequence produced a product with lower mobility than the 30-P oligo. 
consistent with the notion that a 12 amino acid peptide was added to the puromycm. 
Finally for LP154, removal of the coding sequence produced aproduct of yet lower 
20 mobility, consistent with a 33 amino acid sequence attached to the 30-P oligo. No 

oligo was seen in the RNase-treated LP154 reticulocyte lane due to a loading error. In 
Figure 9. the mobility of this product was shown-to be the same as the product 
generated in the wheat germ extract. In sum, theseresults indicated that RNase 
resistant products were added to the ends of the 30-P oligos. that the sizes of the 
25 products were proportional to the length of the coding sequences, and that the 
products were quite homogeneous in size. In addition, although both systems 
produced similar fusion products, the reticulocyte system appeared supenor due to 

higher template stability. 

c.^.;.,,| ^^.PM...AandProtein^ QK. In Figure 9. sensUivity to RNase 

30 A and proteinase K were tested using the LP154 fiision. As shown in lanes 2-4, 
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incorporation of ^'S methionine was demonstrated for the LP 154 template. When this 
product was treated with RNase A, the mobility of the fusion decreased, but was still 
significantly higher than the ^^P labeled 30-P oligonucleotide, consistent with the 
addition of a 33 amino acid peptide to the 3* end. When this material was also treated 
5 with proteinase K, the ^^S signal completely disappeared, again consistent with the 
notion that the label was present in a peptide at the 3' end of the 30-P fragment. 
Similar results have been obtained in equivalent experiments using the 43-P and LP77 
fusions. 

To confirm that the template labeling by ^^S Met was a consequence of 

1 0 translation, and more specifically resulted firom the peptidyl transferase activity of the 
ribosome, the effect of various inhibitors on the labeling reaction was examined. The 
specific inhibitors of eukaryotic peptidyl transferase, anisomycin, gougerotin, and 
sparsomycin (Vazquez, Inhibitors of Protein Biosynthesis (Springer-Verlag, New 
York), pp. 312 (1979)), as well as the translocation inhibitors cycloheximide and 

1 5 emetine (Vazquez, Inhibitors of Protein Biosynthesis (Springer-Verlag, New York), 
pp. 312 (1979)) all decreased RNA-peptide fusion formation by -95% using the long 
myc template and a reticulocyte lysate translation extract, 

Immunoorecipitation Experiments . In an experiment designed to illustrate 
the efficacy of immunoprecipitating an mRNA-peptide fusion, an attempt was made 

20 to immunoprecipitate a free c-myc peptide generated by in vjtrq translation. Figure 
10 shows the results of these experiments assayed on an SDS PAGE peptide gel. 
Lanes I and 2 show the labeled material fix>m translation reactions containing either 
RNA124 (the RNA portion of LP154) or p-globin mRNA. Lanes 3-8 show the 
immunoprecipitation of these reaction samples using the c-myc monoclonal antibody 

25 9E1 0, under several different buffer conditions (described below). Lanes 3-5 show 
that the peptide derived from RNA 124 was effectively immunoprecipitated, with the 
best case being lane 4 where -83% of the total TCA precipitable counts were isolated. 
Lanes 6-8 show little of the P-globin protein, indicating a purification of > 100 fold. 
These results indicated that the peptide coded for by RNA124 (and by LP 154) can be 

30 quantitatively isolated by this immunoprecipitation protocol. 
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translated LP 154 template product, we examined the behavior of this product on two 
different types of chromatography media, Thiopropyl (TP) sepharose allows the 
isolation of a product containing a free cysteine (for example, the LP 154 product 
which has a cysteine residue adjacent to the C terminus) (Figure 13). Similarly, dTjj 
5 agarose allows the isolation of templates containing a poly dA sequence (for example, 
30-P) (Figure 13). Figure 1 4 demonstrates that sequential isolation on TP sepharose 
followed by dTjs agarose produced the same product as isolation on dTjs agarose 
alone. The fact that the in vitro translation product contained both a poly-A tract and 
a free thiol strongly indicated that the translation product was the desired 
10 RNA-peptide fusion. 

The above results are consistent with the ability to synthesize mRNA- 
peptide fusions and to recover them intact from in vitro translation extracts. The 
peptide portions of fusions so synthesized appeared to have the intended sequences as 
demonstrated by immunoprecipitation and isolation using appropriate 
1 5 chromatographic techniques. According to the results presented above, the reactions 
are intramolecular and occur in a template dependent fashion. Finally, even with a 
template modification of less than 1%, the present system facilitates selections based 
on candidate complexities of about 10'^ molecules. 

C-Mvc Epitope Recoverv Selection . To select additional c-myc epitopes, a 
20 large library of translation templates (for example, 10*^ members) is generated 

containing a randomized region (see Figure 7C and below). This library is used to 
generate -10*^ - 10" fusions (as described herein) which are treated with the 
anti-c-myc antibody (for example, by immunoprecipitation or using an antibody 
immobilized on a column or other solid support) to enrich for c-myc-encoding 
25 templates in repeated rounds of in vitro selection. 

Models for Fusion Formation . Without being bound to a particular theory, 
we propose a model for the mechanism of fusion formation in which translation 
initiates normally and elongation proceeds to the end of the open reading frame. 
When the ribosome reaches the DNA portion of the template, translation stalls. At this 
30 point, the complex can partition between two fates: dissociation of the nascent 
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peptide or transfer of the r»ascent peptide to the purotnycm at the 3'-end of the 
Zplate. T^ee£ficier.cyofthe transfer reaction is Ukelyto 

offactors that influence the stabihtyofthe stalled translation complex and the entryof 
the 3--puromycin residue into the A site of the peptidyl transferase center. After the 
, transfer reaction, the n^A-peptide fusion Ukely remains complexed wrth the 

ribosome since thelcnownrelease factors cannothydrolyze the stahle amide hnkage 

between the RNA and peptide domains. 

Both the ctesieal model for elongation (Watson, Boll. Soo. Ctam. Btol. 
46 1399C1964))andthein.ennediatestatesmodel(MoazedandNolle,,Nature 
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30 fusion ptoduet formed increases markedly following extended post-translaho. 
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incubation at low temperature, perhaps because of the increased time available for 
transfer of the nascent peptide to the puromycin. 

nPTATLED MA TRRTALS AND METHODS 
Described below are detailed materials and methods relating to the in vitro 
5 translation and testing of RNA-protein fusions, including fusions having a myc 
epitope tag. 

Sequences. A number of oligonucleotides were used above for the 
generation of RNA-protein fusions. These oligonucleotides have the following 
sequences. 
10 NAME SEQUENCE 

30-P 5'AAAAAAAAAAAAAAAAAAAAAAAAAAACCP(SEQID 

NO:8) 

1 3-P 5'AAA AAA AAA ACC P (SEQ ID NO: 9) 

25-P 5'CGC GGT TTT TAT TTT TTT TTT TCC P (SEQ ID NO: 10) 

15 43-P 5'rGrGrA rGrGrA rCrGrA rArArU rGAA AAA AAA AAA AAA AAA 
AAA AAA AAA ACC P (SEQ ro NO: 11) 

43-P [CUG] 5'rGrGrA rGrGrA iCrGrA rArCrU rGAA AAA AAA AAA AAA 
AAA AAA AAA AAA ACC P (SEQ ID NO: 12) 



20 



40-P 5'rGrGrA rGrGrA rCrGrA rArCrU rGAA AAA AAA AAA AAA AAA 
AAA AAA ACC P (SEQ ID NO: 1 3) 



25 



37-P 5'rGrGrA rGrGrA rCrGrA rArCrU rGAA AAA AAA AAA AAA AAA 
AAA ACC P (SEQ ID NO: 14) 
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34.P 5'rGrGrA rGrGrA rCiGrA rArCrU rGAA AAA AAA AAA AAA AAA 
ACC P (SEQ ID NO: 15) 

3 1 _P 5'rGrGrA rGrGrA rCrGr A rArCrU rGAA AAA AAA AAA AAA ACC P 
(SEQ ID NO: 16) 

5 LP77 5'rGrGrGrArGrGrArCiGrArArArUrGrGrArArCrArGrArArArC 
rUrGrA rUrCrU rCrUrG rArArG rArArG rArCrC rUrGrA r ArC AAA AAA AAA 
AAA AAA AAA AAA AAA AAA CCP (SEQ ID NO: 1) 

LP154 5'rGrGrG rArCrA rArUrU rArCrU rArUrU rUrAiC rArArU rUrArC rA 
rArUrG rGtCrU rGrArA rGrArA rCrArG rArArA rCrUtG rArUrC rUrCrU rGrArA 
10 rGrArA rGtArCrCrUrGrCrUrGrCrGrU rArArA rCrGrUrCrGrU rGrArA rCrArG 

rCrUrG rArArA rCrArC rArArA rCrUrG rGrArA rCrArG rCrUrG rCrGrU rArArC 
rUrCrU rUrGrC rGrCrU AAA AAA AAA AAA AAA AAA AAA AAA AAA CCP 

(SEQIDNO: 3) 

LP160 5' 5-rGrGrG rArCrA rArUrU rAiCrU rArUrU rUrArC rArArU rUrArC rA 
rArUrG rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS 
rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS 
rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rNrNrS rCrArG rCrUrG 
rCrGrU rArArC rUrCrU rUrGrC rGrCrU AAA AAA AAA AAA AAA AAA AAA 
AAA AAA CCP (SEQ ID NO: 17) 



15 



20 



All oligonucleotides are listed in the 5' to 3' direction. Ribonucleotide bases ar^ 
indicated by lower case V prior to the nucleotide designation; P is puromycin; rN 
indicates equal amounts of rA, rG. rC. and rU; rS indicates equal amounts of rG and 
rC; and ail other base designations indicate DNA oligonucleotides. 

ChsmisaiS. Puromycin HCl, long chain alkylamine controlled pore glass, 
25 gougerotin, chloramphenicol, virginiamycin, DMAP. dimethyltrityl chloride, and 
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acetic anhydride were obtained from Sigma Chemical (St. Louis, MO). Pyridine, 
dimethylformamide, toluene, succinic anhydride, and para-nitrophenol were obtained 
from Fluka Chemical (Ronkonkoma, NY). Beta-globin mRNA was obtained from 
Novagen (Madison, WI). TMV RNA was obtained from Boehringer Mannheim 
5 (Indianapolis, IN). 

Enzymes. Proteinase K was obtained from Promega (Madison, WI). 
DNase-free RNAase was either produced by the protocol of Sambrook et al. (supra^ or 
purchased from Boehringer Mannheim. T7 polymerase was made by the published 
protocol of Grodberg and Dunn (J. Bacteriol. 170:1245 (1988)) with the modifications 
10 of Zawadzki and Gross (NucL Acids Res. 19:1948 (1991)). T4 DNA ligase was 
obtained from New England Biolabs (Beverly, MA). 

Quantitation of Radiolabel Incorporation. For radioactive gels bands, the 
amount of radiolabel (^^S or ^^) present in each band was determined by quantitation 
either on a Betagen 603 blot analyzer (Betagen, Waltham, MA) or using 
1 5 phosphorimager plates (Molecular Dynamics, Sunnyvale, CA). For liquid and solid 
samples, the amount of radiolabel (^^S or ^^P) present was determined by scintillation 
counting (Beckman, Columbia, MD). 

Gel Images. Images of gels were obtained by autoradiography (using 
Kodak XAR film) or using phosphorimager plates (Molecular Dynamics). 
20 Synthesis of C PG Puromvcin. Detailed protocols for synthesis of 

CPG-puromycin are outlined above. 

Kn7:vmatic Reactions. In general, the preparation of nucleic acids for 
kinase, transcription, PCR, and translation reactions using coli extracts was the 
same. Each preparative protocol began with extraction using an equal volume of 1 :1 
25 phenol/chloroform, followed by centrifiigation and isolation of the aqueous phase. 
Sodium acetate (pH 5.2) and spermidine were added to a final concentration of 300 
mM and 1 mM respectively, and the sample was precipitated by addition of 3 
volumes of 100% ethanol and incubation at -70**C for 20 minutes. Samples were 
centrifiiged at > 12,000 g, the supernatant was removed, and the pellets were washed 
30 with an excess of 95% ethanol, at O^'C. The resulting pellets were then dried under 
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vacuum and resuspended. 

nn pnnucleotides. All synthetic DNA and RNA was synthesized on a 
Millipore Expedite synthesizer using standard chemistry for each as supplied from the 
manufacturer (MiUigen, Bedford, MA). Oligonucleotides containing 3' puromycin 
5 were synthesized using CPG puromycin columns packed with 30-50 mg of solid 
support (-20 ^mole puromycin/gram). Oligonucleotides containing a 3' biotin were 
synthesized using 1 j.mole bioteg CPG columns from Glen Research (Sterling. VA). 
OHgonucleotides containing a 5' biotin were synthesized by addition of bioteg 
phosphoramidite (Glen Research) as the 5' base. Oligonucleotides to be ligated to the 
10 3- ends of RNA molecules were either chemically phosphorylated at the 5' end (usmg 
chemical phosphorylation reagent from Glen Research) prior to deprotection or 
enzymatically phosphorylated using ATP and T4 polynucleotide kinase (New 
England Biolabs) after deprotection. Samples containing only DNA (and 3' 
puromycin or 3' biotin) were deprotected by addition of 25-/0 NH.OH followed by 
15 incubation for 12 hours at SS-C. Samples containing RNA monomers (e.g.. 43-P) 
were deprotected by addition of ethanol (25«/o (v/v)) to the NH,OH solution and 
incubation for 12 hours at SS^C. The 2'OH was depretected using IM TBAF m THF 
(Sigma) for 48 hours at room temperature. TBAF was removed using a NAP-25 
Sephadex column (Pharmacia, Piscataway, NJ). 
20 Deprotected DNA and RNA samples were then purified using denaturing 

PAGE followed by either soaking or electro-etating from the gel using an Elutrap 
(Schleicher and Schuell. Keene, NH) and desalting using either a NAP-25 Sephadex 
column or ethanol precipitation as described above. 

Hy^^PN A-""*""--*^""- Two DNA templates containing the c-myc 
25 epitope tag were constructed. The first template was made from a combination of the 
oligonucleotides 64.27 (5--GTr CAG GTC TTC TTG AGA GAT CAG TIT CTG 
TTC CAT TTC GTC CTC CCT ATA GTG AGT CGT ATT A-3') (SEQ ID NO: 18) 
and 18.109 (5'-TAA TAC GAC TCA CTA TAG-3-) (SEQ ID NO: 19). Transcription 
using this template produced RNA 47.1 which coded for the peptide 
30 MEQKLISEEDLN(SEQID NO:20). Ligation of RNA 47.1 to 30-P yielded LP77 
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shown in Figure 7A. 

The second template was made first as a single oligonucleotide 99 bases in 
length, having the designation RWR 99.6 and the sequence 5'AGC GCA AGA GTT 
ACG CAG CTG TTC CAG TTT GTG TTT CAG CTG TTC ACG ACG TTT ACG 

5 CAG CAG GTC TTC TTC AGA GAT CAG TTT CTG TTC TTC AGC CAT-3* 
(SEQ ID NO: 21). Double stranded transcription templates containing this sequence 
were constructed by PCR with the oligps RWR 21.103 (5'-AGC GCA AGA GTT 
ACG CAG CTG-30 (SEQ ID NO: 22) and RWR 63.26 (5TAA TAC GAC TCA 
CTA TAG GGA CAA TTA CTA TTT AC A ATT ACA ATG GCT GAA GAA CAG 

10 AAA CTG-3') (SEQ ID NO: 23) according to published protocols (Ausubel et al., 

supra, chapter 15). Transcription using this template produced an RNA referred to as 
RNA124 which coded for the peptide 

MAEEQKLISEEDLLRKRREQLKHKLEQLRNSCA (SEQ ID NO: 24). This 
peptide contained the sequence used to raise monoclonal antibody 9E1 0 when 

1 5 conjugated to a carrier protein (Oncogene Science Technical Bulletin). RNA 1 24 was 
124 nucleotides in length, and ligation of RNA124 to 30-P produced LP154 shown in 
Figure 7B. The sequence of RNA 124 is as follows (SEQ ID NO: 32): 
5'-rGrGrG rArCrA rArUrU rArCrU rArUrU rUrArC rArArU rUrArC rArArUrG 
rGrCrU rGr ArA rGrArA rCrArG rArArA rCrUrG rArUrC rUrCrU rGrArA rGrArA 

20 rGrArC rCrUrG rCrUrG rCrGrU rArArA rCrGrU rCrGrU rGrArA rCrArG rCrUrG 
rArArA rCr ArC rArArA rCrUrG rGrArA rCrArG rCrUrG rCrGrU rArArC rUrCrU 
rUrGrCrGrCrU-3' 

Randomized Pool Construction. The randomized pool was constructed as 
a single oligonucleotide 130 bases in length denoted RWR130.1. Beginning at the 3' 

25 end, the sequence was 3' CCCTGTTAATGATAAATGTTAATGTTAC (NNS)27 
GTC GAC GCA TTG AGA TAC CGA-5' (SEQ ID NO: 25). N denotes a random 
position, and this sequence was generated according to the standard synthesizer 
protocol. S denotes an equal mix of dG and debases. PCR was performed with the 
oligonucleotides 42.108 (5*-TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA 

30 TTT ACA ATT ACA) (SEQ ID NO: 26) and 21.103 (5'-AGC GCA AGA GTT ACG 
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C AG CTG) (SEQ ID NO: 27). Transcription off this template produced an RNA 
denoted pool 130.1 . Ligation of pool 130.1 to 30-P yielded Pool #1 (also referred to as 

LP 160) shown in Figure 7C. 

. Seven cycles of PGR were perfonned according to published protocols 
5 (Ausubel et al., Slipm) with the following exceptions: (i) the starting concentration of 
RWRl 30. 1 was 30 nanomolar, (ii) each primer was used at a concentration of 1.5 nM. 
(iii) the dNTP concentration was 400 ^M for each base, and (iv) the Taq polymerase 
(Boehringer Mannheim) was used at 5 units per 100 jtl. The double stranded product 
was purified on non-denaturing PAGE and isolated by electroelution. The amount of 
10 DNA was determined both by UV absoibance at 260 nm and ethidium bromide 
fluorescence comparison with known standards. 

Tj.-^^^'^rr f='Ypth..i. of RNA. Transcription reactions from double 
stranded PGR DNA and synthetic oUgonucleotides were performed as described 
previously (MilUgan and UWenbeck, Meth. Enzymol. 180:51 (1989)). Full length 
15 RNA was purified by denaturing PAGE, electroeluted. and desalted as descnbed 

above. The pool RNA concentration was estimated using an extinction coefficient of 
1300 O D./nmole; RNA124, 1250 O.D7jimole; RNA 47.1. 480 O.D./nmole. 
Transcription from the double stranded pool DNA produced ~ 90 nanomoles of pool 
RNA. 

20 T:^ ^.f ic. Svnthe -^c »f HTJA-Pnmmvcin Coniugates.. Ligation of the 

myc and pool messenger RNA sequences to the puromycin containing oHgonucleotide 
was performed using a DNA splint, termed 19.35 (S'-TTT TIT TTT TAG GGG AAG 
A) (SEQ ID NO: 28) using a procedure analogous to that described by Moore and 
Sharp (Science 250:992 (1992)). The reaction consisted of mRNA. splint, and 
25 puromycin oligonucleotide (30-P. dA27dGdGP) in a mole ratio of 0.8 : 0.9 : 1.0 and 
1-2 5 unitsofDNAUgase per picomoleofpool mRNA. Reactions were conducted 
for one hour at room temperature. For the constmction of the pool RNA fusions, the 
mRNA concentration was ~ 6.6 ^molar. Following Hgation, the RNA-puromycm 
conjugate was prepared as described above for enzymatic reactions. The precipitate 
30 was resuspended. and full length fusions were purified on denaturing PAGE and 
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isolated by electroelution as described above. The pool RNA concentration was 
estimated using an extinction coefficient of 1650 O.D./^molc and the myc template 
1600 O.D./nmole. In this way, 2.5 nanomoles of conjugate were generated. 

. Preparation of dT,, Streotavidin Agarose. dTjs containing a 3' biotin 

5 (synthesized on bioteg phosphoramidite columns (Glen Research)) was incubated at 
1-10 with a slurry of streptavidin agarose (50% agarose by volume. Pierce, 
Rockford, IL) for 1 hour at room temperature in TE (10 mM Tris Chloride pH 8.2, 1 
mM EDTA) and washed. The binding capacity of the agarose was then estimated 
optically by the disappearance of biotin-dTjs from solution and/or by titration of the 

1 0 resin with known amounts of complementary oligonucleotide. 

Translation Reactions using E. coli D erived Extracts and Ribosomes. In 
general, translation reactions were performed with purchased kits (for example, E. coli 
S30 Extract for Linear Templates, Promega, Madison, WI). However, R cqU 
MRE600 (obtained from the ATCC, Rockville, MD) was also used to generate S30 

1 5 extracts prepared according to published protocols (for example, Ellman et al., Meth. 
EnzymoL 202:301(1991)), as well as aribosomai fraction prepared as described by 
Kudlicki et al. (Anal, Biochem. 206:389 (1992)). The standard reaction was 
performed in a 50 ^1 volume with 20-40 jiCi of ^^S methionine as a marker. The 
reaction mixture consisted of 30% extract v/v, 9-18 mM MgClj, 40% premix minus 

20 methionine (Promega) v/v, and 5 ^M of template (e.g., 43-P). For coincubation 
experiments, the oligos 13-P and 25-P were added at a concentration of 5 \iM. For 
experiments using ribosomes, 3 ^1 of ribosome solution was added per reaction in 
place of the lysate. All reactions were incubated at 37^*0 for 30 minutes. Templates 
were pudfied as described above under enzymatic reactions. 

25 Wheat Germ Translation Reactions. The translation reactions in Figure 8 

were performed using purchased kits lacking methionine (Promega), according to the 
manufacturer's recommendations. Template concentrations were 4 ^M for 43-P and 
0.8 \iM for LP77 and LP154. Reactions were performed at 25^*0 with 30 ^Ci ^^S 
methionine in a total volume of 25 ^1. 

30 Reticulocyte Translation Reactions. Translation reactions were performed 
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either with purchased kits (Novagen, Madison, WI) or usir>g extract prepared 
according to published protocols (Jackson and Hunt. Meth. Enzymol. 96:50 (1983)). 
Reticulocyte-rich blood was obtained from Pel-Freez Biologicals (Rogers. AK). In 
both cases, the reaction conditions were those recommended for use with Red Nova 
5 Lysate (Novagen). Reactions consisted ofl 00 mMKCl. 0.5 mMMgOAc. 2 mM 
DTT 20 mM HEPES pH 7.6, 8 mM creatine phosphate. 25 nM in each ammo ac,d 
(with the exception of methionine if -S Met was used), and 40% v/v of lysate. 
Incubation was at 30»C for 1 hour. Template concentrations depended on the 
experiment but generally ranged from 50 nM to 1 ^M with the exception of 43-P 

1 0 (Figure 6H) which was 4 nM. 

For generation of the randomized pool, 10 ml of translation reaction was 
performed at a template concentration of- 0.1 nM (1.25 nanomoles of template). In 
addition -P labeled template was included in the reaction to allow determmation of 
the amount of material present at each step of the purification and selection procedure. 
1 5 After translation at 30-C for one hour, the reaction was cooled on ice for 30-60 

minutes. . 

with dT, . Str n rT ^ vid i P Ag^S ^ After incubafon, the 

translation reaction was diluted approximately 150 fold into isolation buffer (1.0 M 
NaCl 0 1 M Tris chloride pH 8.2. 10 mM EDTA, 1 mM DTT) containing greater 
20 than a lOX molar excess of dT„-biotin-st^tavidin agarose whose dT. concentratton 
was ~ 10 nM (volume of slurry equal or greater than the volume of lysate) and 
incubated with agitation at 4-'C for one hour. :ibe agarose was then removed from the 
mixture either by filtration (MiUipore ultrafree MC filters) or centrifugation and 
washed with cold isolation buffer 2-4 times. The template was then liberated from the 
dT. streptavidin agarose by repeated washing with 50-100 ^ aliquots of 1 5 mM 
NaOH ImMEDTA. The eluent was immediately neutraUzed in 3M NaOAc pH 5.2, 
1 0 mM spermidine, and was ethanol precipitated. For the pool reaction, the total 
radioactivity recovered indicated approximately 50-70-/0 of the input template was 
recovered. 

»rp».;nn with ThionmPY l ?Sq)har°se. Fusions contaimng 



25 



30 
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cysteine can be purified using thiopropyl sepharose 6B as in Figure 13 (Pharmacia). 
In the experiments described herein, isolation was either carried out directly from the 
translation reaction or following initial isolation of the fusion (e.g., with streptavidin 
agarose). For samples purified directly, a ratio of 1 :10 (v/v) lysate to sepharose was 

5 used. For the pool, 0.5 ml of sepharose slurry was used to isolate all of the fusion 

material from 5 ml of reaction mixture. Samples were diluted into a 50:50 (v/v) slurry 
of thiopropyl sepharose in IX TE 8.2 (10 mM Tris-Cl, 1 mM EDTA, pH 8.2) 
containing DNase free RNase (Boehringer Mannheim) and incubated with rotation for 
1-2 hours at 4° C to allow complete reaction. The excess liquid was removed, and the 

1 0 sepharose was washed repeatedly with isolation buffer containing 20 mM DTT and 
recovered by centrifugation or filtration. The fusions were eluted from the sepharose 
using a solution of 25-30 mM dithiothreitol (DTT) in 10 mM Tris chloride pH 8.2, 1 
mM EDTA. The fusion was then concentrated by a combination of evaporation under 
high vacuum and ethanol precipitation as described above. For the pool reaction, the 

1 5 total radioactivity recovered indicated approximately 1 % of the template was 
converted to fusion. 

For certain applications, dTjj was added to this eluate and rotated for 1 
hour at 4°C. The agarose was rinsed three times with cold isolation buffer, isolated 
via filtration, and the bound material eluted as above. Carrier tRNA was added, and 

20 the fusion product was ethanol precipitated. The sample was resuspended in TE pH 
8.2 containing DNase free RNase A to remove the RNA portion of the template. 

Immunoprecipitation Reactions. Immunoprecipitations of peptides from 
translation reactions (Figure 10) were performed by mixing 4 fil of reticulocyte 
translation reaction, 2 jil nonnal mouse sera, and 20 \i\ Protein G + A agarose 

25 (Calbiochem, La Jolla, CA) with 200 ^1 of either PBS (58 mM Na2HP04, 1 7 mM 

NaH2P04, 68 mM NaCl), dilution buffer (10 mM Tris chloride pH 8.2, 140 mM NaCl. 
1% v/v Triton X-100), or PBSTDS (PBS + l%Triton X-100, 0.5% deoxycholate 
0.1% SDS). Samples were then rotated for one hour at 4°C, followed by 
centrifugation at 2500 rpm for 1 5 minutes. The eluent was removed, and 10 jil of 

30 c-myc monoclonal antibody 9E10 (Calbiochem, La Jolla, CA) and 1 5 ^l of Protein G 
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, A agarose was added and rotated for 2 hours at 4-C. Samples were then washed 
with two 1 ml volumes of either PBS, dilution buffer, or PBSTOS. 40 ^ of gel 
loading buffer (Calbiochem Product Bulletin) was added to the mixture, and 20 ^il 
was loaded on a denaturing PAGE as described by Schagger and von Jagow (Anal. 
5 Biochem. 166:368 (1987)). 

Immunoprecipitations of fusions (as shown in Figure 11 ) were performed 
by mixing 8 ^ of reticulocyte translation reaction with 300 m of dilution buffer (10 
mM Tris chloride pH 8.2, 140 mM NaCl. 1% v/v Triton X-100). 15 ^ protein G 
sepharose (S.gma), and 10 ^1 (1 ^g) c-myc antibody 9B10 (Calbiochem). followed by 
10 rotation for several hours at 4-C. After isolation, samples were washed, treated with 
DNase free RNase A. labeled with polynucleotide kinase and gamma ATP, and 
separated by denaturing urea PAGE (Figure 1 1). 

o.„.... Tr.n.r.rintior Reverse transcription reactions 

were performed according to the manufacturers recommendation for Superscript II. 
1 5 except that the template, water, and primer were incubated at 70°C for only two^ 
minutes (Gibco BRL, Grand Island, NY). To monitor extension, 50 jtCi alpha '^P 
dCTP was included in some reactions; in other reactions, reverse transcription was 
monitored using 5' »P-labeled primers which were prepared using ^^P aATP (New 
England Nuclear, Boston, MA) and T4 polynucleotide kinase (New England B.olabs, 
20 Beverly, MA). 

Protein G .nd ftntibodv 8<^har9ge. Two aliquots of 50 ^1 
Protein G sepharose slurry (50 % solid by volume) (Sigma) were washed with 
dilution buffer (10 mM Tris chloride pH 8.2, 140 mM NaCl, 0.025o/„ Wv 
Triton X-100) and isolated by centrifugation. The first aliquot was reserved for use as 
25 a precolumn prior to the selection matrix. After resuspension of the second ahquot m 
dilution buffer. 40 ^g of c-myc AB-1 monoclonal antibody (Oncogene Science) was 
added and the reaction incubated overnight at 4°C with rotation. Hie antibody 
sepharose was then purified by centrifugation for 15 minutes at 1500-2500 rpm in a 
microcentrifuge and washed 1-2 times with dilution buffer. 
30 SelestisiL After isolation of the fixsion and complementary strand 
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synthesis, the entire reverse transcriptase reaction was used directly in the selection 
process. Two protocols are outlined here. For round one, the reverse transcriptase 
reaction was added directly to the antibody sepharose prepared as described above and 
incubated 2 hours. For subsequent rounds, the reaction is incubated -2 hours with 

5 washed protein G sepharose prior to the antibody column to decrease the number of 
binders that interact with protein G rather than the immobilized antibody. 

To elute the pool from the matrix, several approaches may be taken. The 
first is washing the selection matrix with 4% acetic acid. This procedure Uberates the 
peptide from the matrix. Alternatively, a more stringent washing (e.g., using urea or 

10 another denaturant) may be used instead or in addition to the acetic acid approach. 

PCR of Selected Fusions. Selected molecules are amplified by PGR using 
standard protocols as described above for construction of the pool. 

SYNTHESIS AND TESTIN G OF BETA-GT.OBTN FUSIONS 

To synthesize a P-globin fusion construct, P-globin cDNA was generated 

1 5 from 2.5 ^ig globin mRNA by reverse transcription with 200 pmoles of primer 1 8. 1 55 
(5' GTG GTA TTT GTG AGC CAG) (SEQ ED NO: 29) and Superscript reverse 
transcriptase (Gibco BRL) according to the manufacturer's protocol. The primer 
sequence was complementary to the 18 nucleotides of p-globin 5' of the stop codon. 
To add a T7 promoter, 20 nl of the reverse transcription reaction was removed and 

20 subjected to 6 cycles of PGR with primers 18.155 and 40.54 (5' TAA TAG GAG TGA 
GTA TAG GGA GAG TTG GTT TTG AGA GAA G) (SEQ ID NO: 30). The 
resulting "syn-p-globin" mRNA was then generated by T7 runoff transcription 
according to Milligan and Uhlenbeck (Methods Enzymol. 180:51 (1989)), and the 
RNA gel purified, electrocuted, and desalted as described herein. "LP-P-globin" was 

25 then generated from the syn-P-globin construct by ligation of that construct to 30-P 
according to the method of Moore and Sharp (Science 256:992 (1992)) using primer 
20.262 (5' TTT TTT TTT T GTG GTA TTT G) (SEQ ID NO: 31) as the splint. The 
product of the ligation reaction was then gel purified, electroeluted, and desalted as 
above. The concentration of the final product was determined by absorbance at 260 
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These P-globin templates were then translated in vitm as described in 
Table 1 in a total volume of 25 m each. Mg- was added from a 25 mM stock 
solution. All reactions were incubated at 30°C for one hour and placed at -20''C 
overnight. dT„ precipitable CPM's were then determined twice using 6 m of lysate 
and averaged minus background. 

Translation Reactions with Beta-Globin Templates 
Reaction Template 



10 



Mg^* 
(mM) 


«S Met 
(til) 


TCA CPM 
(2 111) 


dTjs CPM 
(6 til) 


1.0 
0.5 


2.0 (20 nCi) 
2.0 (20 nCi) 


3312 
33860 


0 
36 


1.0 


2.0 (20 jiCi) 


22470 


82 


2.0 


2.0 (20 \iCi) 


15696 


S6 


0.5 


2.0 (20 nCi) 


32712 


218 


1.0 


2.0 (20 ^Ci) 


24226 


402 


2.0 


2.0 (20 >iCi) 


15074 


270 



1 

2 2.5 
syn-P-globin 

3 2.5 HE 
15 syn-P-globin 

4 2.5 
syn-P-globin 

5 2.5 ng 
LP-p-globin 

20 6 2.5 ^ig 

LP-p-globin 

7 2.5 |iig 

LP-P-globin 

To prepare the samples for gel analysis, 6 ^l of each translation reaction 
25 was mixed with 1000 ,1 of Isolation Buffer (1 M NaCl. 100 mM Tris-ClpH 8.2 10 
mM EDTA, 0.1 mM DTT), I ^ RNase A (DNase Free, Boehringer Mannherm), and 
20 m of 20 ^M dT,3 streptavidm agarose. Samples were incubated at 4-C for one 
hour with rotation. Excess lsolat.on Buffer was removed, and the samples were added 

to a MilliporeMC filter to remove any remaining isolation Buffer. Samples were 
30 thenwashedfourtimeswith50.1ofHAandtwicewith50^lofl5mMNaOa^ 
EDTA The sample (300 m) was neutralized with 100 ^ TE pH 6.8 (10 mM 
Tris-Cl. 1 mMEDTA), 1 mof 1 mg/ml RNase A (as above) was added, and the 
samples were incubated at 37°C. 10 ,1 of 2X SDS loading buffer (125 mM Tns-Cl 
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pH 6.8, 2% SDS, 2% P-mercaptoethanol 20% glycerol, 0.001% bromphenol blue) 
was then added, and the sample was lyophilized to dryness and resuspended in 20 ^il 
H2O and 1% P-mercaptoethanol. Samples were then loaded onto a peptide resolving 
gel as described by Schagger and von Jagow (Analytical Biochemistry 166:368 
5 (1 987)) and visualized by autoradiography. 

The results of these experiments are shown in Figures 15A and 15B. As 
indicated in Figure 15 A, ^^S-methionine was incorporated into the protein portion of 
the syn-P-globin and LP-p-globin fusions. The protein was heterogeneous, but one 
strong band exhibited the mobility expected for P-globin mRNA. Also, as shown in 
1 0 Figure 1 5B, after dTis isolation and RNase A digestion, no ^^S-labeled material 
remained in the syn-p-globin lanes (Figure 15B, lanes 2-4). In contrast, in the 
LP-p-globin lanes, a homogeneously sized ^^S-labeled product was observed. 

These results indicated that, as above, a fusion product was isolated by 
oligonucleotide affinity chromatography only when the template contained a 3' 
15 puromycin. This was confirmed by scintillation counting (see Table 1). The material 
obtained is expected to contain the 30-P linker fused to some portion of P-globin. The 
fusion product appeared quite homogeneous in size as judged by gel analysis. 
However, since the product exhibited a mobility very similar to natural P-globin 
(Figures 15A and 15B, control lanes), it was difficult to determine the precise length 
20 of the protein portion of the fusion product. 

TTTTRTHFR OPTIMIZATION OF R NA-PROTEIN FUSION FORMATION 
Certain factors have been found to further increase the efficiency of 
formation of RNA-peptide fusions. Fusion formation, i.e., the transfer of the nascent 
peptide chain from its tRNA to the puromycin moiety at the 3' end of the mRNA, is a 
25 slow reaction that follows the initial, relatively rapid translation of the open reading 
fi-ame to generate the nascent peptide. The extent of fusion formation may be 
substantially enhanced by a post-translational incubation in elevated Mg^^ conditions 
(preferably, in a range of 50-100 mM) and/or by the use of a more flexible linker 
between the mRNA and the puromycin moiety. In addition, long incubations (12-48 
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hours) at low temperatures (preferably, 

-20°C) also result in increased yields effusions with less mRNA degradation than 
that which occurs during incubation at 30°C. By combining these factors, up to 40% 
of the input mRNA may be converted to mRNA-pqjtide fusion products, as shown 
5 below. 

Synthesis of mBNA-Pnm Tnyr.in Cnniugates. In these optimization 
experiments, puromycin-containing linker oligonucleotides were Ugated to the 3* ends 
of mRNAs using bacteriophage T4 DNA ligase in the presence of complementary 
DNA splints, generally as described above. Since T4 DNA ligase prefers precise 
10 base-pairing near the Ugation junction and run-off transcription products with T7, T3, 
or SP6 RNA polymerase are often heterogeneous at their 3' ends (Nucleic Acids 
Research 15:8783 (1987)), only those RNAs containing the conrect 3'-terminal 
nucleotide were efficiently ligated. When a standard DNA splint was used, 
approximately 40% of runoff transcription products were ligated to the puromycin 
15 oligo. The amount of Ugation product was increased by using excess RNA, but was 
not increased using excess puromycin oligonucleotide. Without being bound to a 
particular theory, it appeared that the limiting factor for hgation was the amount of 
RNA which was fiilly complementary to the con-esponding region of the DNA splint. 

To allow ligation of those transcripts ending with an extra non-templated 
20 nucleotide at the 3' terminus (termed "N+1 products"), a mixture of the standard DNA 
splint with a new DNA splint containing an additional random base at the Ugation 
junction was used. The Ugation efficiency increased to more than 70% for an 
exemplary myc RNA template (that is, RNA124) in the presence of such a mixed 
DNA splint. 

25 In addition to this modified DNA splint approach, the efficiency of 

mRNA-puromycin conjugate fomiation was also finther optimized by taking into 
account the following three factors. First, mRNAs were preferably designed or 
utilized which lacked 3'-termini having any significant, stable secondary stracture that 
would interfere with annealing to a splint oUgonucleotide. In addition, because a high 

30 concentration of salt sometimes caused failure of the Ugation reaction, thorough 
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desalting of the oligonucleotides using NAP-25 columns was preferably included as a 
step in the procedure. Finally, because the ligation reaction was relatively rapid and 
was generally complete within 40 minutes at room temperature, significantly longer 
incubation periods were not generally utilized and often resulted in unnecessary 
5 degradation of the RNA. 

Using the above conditions, mRNA-puromycin conjugates were 
synthesized as follows. Ligation of the myc RNA sequence (RNA124) to the 
puromycin-containing oligonucleotide was performed using either a standard DNA 
splint (e.g., 5'-TTTTTTTTTTAGCGCAAGA) or a splint containing a random base 
10 (N) at the ligation junction (e.g., 5'-TTTTTTTTTTNAGCGC AAGA). The reactions 
consisted of mRNA, the DNA splint, and the puromycin oligonucleotide in a molar 
ratio of 1.0 : 1 .5-2.0 : 1 .0. A mixture of these components was first heated at 94°C 
for 1 minute and then cooled on ice for 15 minutes. Ligation reactions were 
performed for one hour at room temperatixre in 50 mM Tris-HCl (pH 7.5), 10 mM 
15 MgClj, 10 mM DTT, 1 mM ATP, 25 ^ig/ml BSA, 15 ^M puromycin oligo, 15 \iM 
mRNA, 22.5-30 ^M DNA splint, RNasin inhibitor (Promega) at 1 U/^il, and 1.6 units 
of T4 DNA ligase per picomole of puromycin oligo. Following incubation, EDTA 
was added to a final concentration of 30 mM, and the reaction mixtures were 
extracted with phenol/chloroform. Full length conjugates were purified by denaturing 
20 PAGE and isolated by electroelution. 

General Reticulocyte Translation Conditions . In addition to improving the 
synthesis of the mRNA-puromycin conjugate, translation reactions were also fiuther 
optimized as follows. Reactions were performed in rabbit reticulocyte lysates from 
different commercial sources (Novagen, Madison, WI; Amersham, Arlington Heights, 
25 IL; Boehringer Mannheim, IndianapoHs, IN; Ambion, Austin, TX; and Promega, 
Madison, WI). A typical reaction mixture (25 ^1 final volume) consisted of 20 mM 
HEPES pH 7.6, 2 mM DTT, 8 mM creatine phosphate, 100 mM KCl , 0.75 mM 
Mg(OAc)2, 1 mM ATP, 0.2 mM GTP, 25 \iM of each amino acid (0.7 \iM methionine 
if ^^s-Met was used), RNasin at 1 U/^l, and 60% (v/v) lysate. The final concentration 
30 of template was in the range.of 50 nM to 800 nM. For each incubation, all 
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components except lysate were mixed carefully on ice, and the frozen lysate was 
thawed immediately before use. After addition of lysate, the reaction mixture was 
mixed thoroughly by gentle pipetting and incubated at 30«>C to start translation. THe 
optimal concentrations of Mg^^ and varied within the ranges of 0.25 mM - 2 mM 
5 and 75 mM - 200 mM. respectively, for different mRNAs and was preferably 

determined in preliminary experiments. Particularly for poorly translated mRNAs, 
the concentrations of hemin. creatine phosphate, tRNA, and amino acids were also 
sometimes optimized. Potassium chloride was generally preferred over potassium 
acetate for fusion reactions, but a mixture of KCl and KOAc sometimes produced 

10 better results. 

After translation at SCC for 30 to 90 minutes, the reaction was cooled on 
ice for 40 minutes, and Mg^^ was added. The fmal concentration of Mg^* added at 
this step was also optimized for different mRNA templates, but was generally in the 
range of 50 mM to 100 mM (with 50 mM being preferably used for pools of mixed 
15 templates). The resulting mixture was incubated at -20-C for 16 to 48 hours. To 

visuaUze the labeled fusion products. 2 jil of the reaction mixture was mixed with 4 jil 
loading buffer, and the mixture was heated at 75''C for 3 minutes. The resulting 
mixture was then loaded onto a 6% glycine SDS-polyacrylamide gel (for -P-labeled 
templates) or an iVo tricine SDS-polyacrylamide gel (for -S-Met-labeled templates). 
20 As an alternative to this approach, the fusion products may also be isolated using dT,, 
streptavidin agarose or thiopropyl sepharose (or both), generally as described herem. 

To remove the RNA portion of the RNA-linker-puromycin-peptide 
conjugate for subsequent analysis by SDS-PAGE, an appropriate amount ofEDTA 
was added after post-translational incubation, and the reaction mixture was desalted 
25 using a microcon-10 (or microcon-30) column. 2 ^l of the resulting mixture 

(approximately 25 ^1 total) was mixed with 1 8 ^1 of RNase H buffer (30 mM Tris- 
HCl, pH 7.8. 30 mM (NH,).SO,. 8 mM MgCl,, 1.5 mM P-mercaptoethanol. and an 
appropriate amount of complementary DNA splmt), and the mixture was incubated at 
4°C for 45 minutes. RNase H was then added, and digestion was perfomied at 37°C 
30 for 20 minutes. 
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Oualitv of Puromvcin Oligo . The quality of the puromycin 
oligonucleotide was also important for the efBcient generation of fusion products. 
The coupling of 5*-DMT, 2*-succinyl, N-trifluoroacctyl puromycin with CPG was not 
as efficient as the coupling of the standard nucleotides. As such, the coupling reaction 
5 was carefully monitored to avoid the formation of CPG with too low a concentration 
of coupled puromycin, and unreacted amino groups on the CPG were fully quenched 
to avoid subsequent synthesis of oligonucleotides lacking a 3'-tenninal puromycin. It 
was also important to avoid the use of CPG containing very fine mesh particles, as 
these were capable of causing problems with valve clogging during subsequent 
1 0 automated oligonucleotide synthesis steps. 

In addition, the synthesized puromycin oligo was preferably tested before 
large scale use to ensure the presence of puromycin at the 3' end. In our experiments, 
no fusion was detected if puromycin was substituted vnih a deoxyadenosine 
containing a primary amino group at the 3* end. To test for the presence of 3* 
1 5 hydroxyl groups (i.e., the undesired synthesis of oligos lacking a 3'-terminal 
puromycin), the puromycin oligo may first be radiolabeled (e.g., by 5 - 
phosphorylation) and then used as a primer for extension with terminal 
deoxynucleotidyl transferase. In the presence of a 3'-terminal puromycin moiety, no 
extension product should be observed. 
20 Time Course of Translation and Post-Translat ional Incubation. The 

translation reaction was relatively rapid and was generally completed within 25 
minutes at 30 **C. The fusion reaction, however, was slower. When a standard linker 
(dAzTdCdCP) was used at 30 °C, fusion synthesis reached its maximum level in an 
additional 45 minutes. The post-translational incubation could be carried out at lower 
25 temperatures, for example, room temperature, C^C, or -20'*C. Less degradation of the 
mRNA template was observed at -20**C, and the best fusion results were obtained 
after incubation at -20*'C for 2 days. 

The Effect of M^^^ Concentration . A high concentration of Mg^* in the 
post-translational incubation greatly stimulated fusion formation. For example, for 
30 the myc RNA template described above, a 3-4 fold stimulation of fusion formation 
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was observed using a standard linker (dA.^CdCP) in the presence of 50 mM Mg^^ 
during the 1 6 hour incubation at -20°C (Figm. 17. compare lanes 3 and 4). SxmUarly, 
efficient fusion formation was also observed using a post-translational incubation m 
the presence of a 50-100 mM Mg^^ concentration when the reactions were carried out 
5 at room temperature for 30-45 minutes. 

,^y.,^^^..r..^^n^cc. Hxedependenceofthefusionreactionon 

the lengthofthelinkerwas also examined. In the range between 21 and 30 
nucleotides(n=18-27), little change was seen in the efficiency oftixefi^ionreaction 

(as described above). Shorter linkers (e.g.. 1 3 nucleotides in length) resulted in lower 
10 fusion. Inaddition,althoughparticularlinkersofgreaterlength(tiiatis,of45 

nucleotides and 54 nucleotides) also resulted in somewhat lower fosion efficiences U 
remains likely that yet longer linkers may alsobe used to optimize the efficiency of 

the fusion reaction. 

With respect to linker sequence, substitution of deoxyribonucleotide 
1 5 residues near the 3' end with ribonucleotide residues did not significantly change the 
fusionefficiency. The dCdCP (or rCrCP) sequence at the 3' end of the linker was, 
.owever, important to fUsion formation. Substitution of dCdCP with dUdUP reduced 
the efficiency of ftision formation significantiy. 

dependence of the fiasion reaction on the flexxb.hty 
20 ofthelinkerwasalsotested. In these experiments, it was determined that the fission 
efficiency was low if the rigidity of ti.e linker was increased by annealing w.tix a 
complementaryoligonucleotidenearthe3-end.Similarly,whenamo^fl^^^^^^ 

linker (for example, dA,.CC,C,dAdCdCP. where C, represents HO(CH.CH.O)3PO.) 
.as used, ti.e fusionefficiency was significantly improved. Compared to the standard 

25 linker(dA.^CdCP).«seofti.emoreflexiblelinker(dA.C,C,C,dAdCdCP)improved 

the fusion efficiency for RNA124more than 4-fold (Figure IT.comparelanesland 

9) In addition, in contrast to the template with the standard linker whose post- 
translationfusionproceededpoorlyintheabsenceofahighconcentratio^ 
(Figure 1 7, lane 3 and 4). ti.e template witi. the flexible linker did not require elevated 
30 Mg- to produce a good yield of fiasion product in an extended post-translational 
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incubation at -20°C (Figure 17, compare lanes 1 1 and 12). This linker, therefore, was 
very useful if post-translational additions of high concentrations of Mg^^ were not 
desired. In addition, the flexible linker also produced optimal fusion yields in the 
presence of elevated Mg^*. 

5 Quantitation of Fusion EfTiciencv . Fusion efficiency may be expressed as 

either the fraction of translated peptide converted to fusion product, or the fraction of 
input template converted to fiision product. To determine the fraction of translated 
peptide converted to fusion product, ^^S-Met labeling of the translated peptide was 
utilized. In these experiments, when a dA27dCdCP or dA27rCrCP linker was used, 

1 0 about 3.5% of the translated peptide was fused to its mRNA after a 1 hour translation 
incubation at 30°C. This value increased to 12% after overnight incubation at -20"* C. 
When the post-translational incubation was carried out in the presence of a high 
concmtration of Mg^*, more than 50% of the translated peptide was fused to the 
template. 

1 5 For a template with a flexible linker, approximately 25% of the translated 

peptide was fused to the template after 1 hour of translation at BO'^C. This value 
increased to over 50% after overnight incubation at -20 °C and to more than 75% if 
the post-translational incubation was performed in the presence of 50 mM Mg**. 

To determine the percentage of the input template converted to fusion 

20 product, the translations were performed using ^^p-labeled mRNA-linker template. 

When the flexible linker was used and post-translational incubation was performed at 
-20'^C without addition of Mg^*, about 20%, 40%, 40%, 35%, and 20% of the input 
template was converted to mRNA-peptide fusion when the concentration of the input 
RNA template was 800, 400, 200, 100, and 50 nM, respectively (Figure 18). Similar 

25 results were obtained when the post-translational incubation was performed in the 

presence of 50 mM Mg^*. The best results were achieved using lysates obtained from 
Novagen, Amersham, or Ambion (Figure 19). 

The mobiUty differences between mRNAs and mRNA-peptide fusions as 
measured by SDS-PAGE may be very small if the mRNA template is long. In such 

30 cases, the template may be labeled at the 5' end of the linker with ^^P. The long RNA 
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portion may then be digested with RNase H in the presence of a complementary DNA 
splint after translation/incubation, and the fusion efficiency determined by 
quantitation of the ratio of unmodified linker to linkcr-peptide fusion. Compared to 
RNase A digestion, which produces 3--P and S'-OH, this approach has the advantage 
5 that the ^'P at the 5' end of the linker is not removed. 

j,,,^^. in addition to the above experiments, we tested whether the fusion 
reaction that occurred at -lO^C in the presence of Mg- was intra- or intermolecular m 
nature. Free linker (dA^^CdCP or dA.CCQdAdCdCP, where C, ts - 
10 0(CH,CH,O)3PO.-) was coincubated with atemplate containing a DNA Imker. but 
without puromycin at the 3" end. under the translation and post-translational 
incubation conditions described above. In these experiments, no detectable amount 
(that is less than 2% of the normal level) of -S-Met was incorporated mto Imker- 
peptide product, suggesting that post-translational fusion occurred primarily between 
15 the nascent peptide and the mRNA bound to the same ribosome. 

^.,.....onl.esults. AS illustrated above, by usingthe flexible hnker 
and/or performing the post-translational incubation in the presence of a high 
concentration of Mg^% fusion efficiencies were increased to approximately 40o/„ of 
input mRNA. lUese results indicated that as many as 10- molecules of mRNA- 
20 peptidefusioncouldbegeneratedpermlofmyitmtranslationreact.onmxx, 

producing pools of mRNA-peptide fusions of very high complexity for use in m Bim 
selection experiments. 

^.r^r^rv T^KRTC m^FMT OF PN^-PBOTF^ FUSIONS 
We have demonstrated the feasibility of using RNA-peptide fusions m 
25 selectionandevolutionexperimentsbyenrichingaparticularRNA-peptidefUsion 

from a complex pool of random sequence fusions on the basis of the encoded pept.de. 
in particular, we prepared a series of mixtures in which a small quantity of known 
sequence (in this case, the long myc template, LPl 54) was combined with some 
amount of random sequence pool (that is. LP160). These mixtures were translated. 
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and the RNA-peptide fusion products selected by oligonucleotide and disulfide 
affinity chromatography as described herein. The myc-template fusions were 
selectively immunoprecipitated with anti-myc monoclonal antibody (Figure 16A). To 
measure the enrichment obtained in this selective step, aiiquots of the mixture of 

5 cDNA/mRNA-peptide fusions from before and after the immunoprecipitation were 
amplified by PGR in the presence of a radiolabeled primer. The amplified DNA was 
digested with a restriction endonuclease that cut the myc template sequence but not 
the pool (Figures 16B and 16C). Quantitation of the ratio of cut and uncut DNA 
indicated that the myc sequence was enriched by 20-40 fold relative to the random 

1 0 library by immunoprecipitation. 

These experiments were carried out as follows. 

Translation Reactions . Translation reactions were performed generally as 
described above. Specifically, reactions were performed at 30**C for one hour 
according to the manufacturer's specifications (Novagen) and frozen overnight at 
1 5 -20'*C. Two versions of six samples were made, one containing methionine and 
one containing cold methionine added to a final concentration of 52 ^M. Reactions 
1-6 contained the amounts of templates described in Table 2. All numbers in Table 2 
represent picomoles of template per 25 \il reaction mixture. 



TABLED 

20 Template Ratios Used in Doped Selection 

Reaction LP154 LP160 

2 5 

3 1 20 
25 4 0.1 20 

5 0.01 20 

6 — 20 



30 



Pre paration of dT. , Streptavidin Agarose. Streptavidin agarose (Pierce) 
was washed three times with TE 8.2 (10 mM Tris Cl pH 8.2, 1 mM EDTA) and 
resuspended as a 1 : 1 (v/v) slurry in TE 8.2. 3' biotinyl T25 synthesized using Bioteg 
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CPG (Glen Research) was then added to the desired final concentration (generally 10 
or 20 nM), and incubation was carried out with agitation for 1 hour. The dTj, 
streptavidin agarose was then washed three times with TE 8.2 and stored at 4°C until 



use. 



10 



Pl^^fi ^^tinr, of T ^pl^t^^ fmm Translation Reactions. To purify templates 
from translation reactions, 25 nl of each reaction was removed and added to 7.5 ml of 
Isolation Buffer (1 M NaCl. 100 mM Tris Cl pH 8.2, 10 mM EDTA. 0.1 mM DTT) 
and 125 nl of 20 fiM dT„ streptavidin agarose. This solution was incubated at rc 
for one hour with rotation. The tubes were centrifuged and the eluent removed. One 
ml of Isolation Buffer was added, the slurry was resuspended, and the mixtures were 
transferred to 1 .5 ml microcentrifuge tubes. The samples were then washed four 
times with I ml aliquots of ice cold Isolation Buffer. Hot and cold samples from 
identical reactions were then combined in a filter Millpore MC filter unit and were 
eluted from the dT« agarose by washmg with 2 volumes of 100 ^l HA 0.1 mM DTT, 
15 and 2 volumes of 1 5 mM NaOH. 1 mM EDTA. 

To this eluent was added 40 ^1 of a 50% sluny of washed thiopropyl 
sephaxose (Pharmacia), and incubation was carried out at A'-C with rotation for 1 hour. 
The samples were then washed with three 1 ml volumes of TE 8.2 and the eluent 
removed. One jil of 1 M DTT was added to the soUd (total volume approximately 20- 
20 30 ^1), and the sample was incubated for several hours, removed, and washed four 
times with 20 nl H^O (total volume 90 nl). The eluent contained 2.5 mM 
thiopyridone as judged by UV absoibance. 50 »il of this sample was ethanol 
precipitated by adding 6 jil 3 M NaOAc pH 5.2, 10 mM spermine, 1 ^l glycogen (10 
mg/ml. Boehringer MamAeim), and 170 jtl 100% EtOH, incubating for 30 minutes at 
25 -70°C. and centrifuging for 30 minutes at 13,000 rpm in a microcentrifuge. 

Tr:.nscri pt;.^e Reactions. Reverse transcription reactions were 
performed on both the ethanol precipitated and the thiopyridone eluent samples as 
follows. For the ethanol precipitated samples, 30 ^l of resuspended template, H^O to 
48 jil, and 200 picomoles of primer 21.103 (SEQ ID NO: 22) were amiealed at 70»C 
30 for 5 minutes and cooled on ice. To this sample, 16 ^1 of first strand buffer (250 mM 
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Tris-Cl pH 8.3, 375 mM KCl, and 15 mM MgClj; available from Gibco BRL, Grand 
Island, NY), 8 jil 100 mM DTT, and 4 |il 10 mM NTP were added and equilibrated at 
42'*C, and 4 |il Superscript II reverse transcriptase (Gibco BRL, Grand Island, NY) 
was added. H2O (13 ^1) was added to the TP sepharose eiuent (35 ^l), and reactions 

5 were performed as above. After incubation for one hour, like numbered samples were 
combined (total volume 160 p.1). 10 ^il of sample was reserved for the PCR of each 
unselected sample, and 150 ^1 of sample was reserved for immunoprecipitation. 

Immunoprecipitation . To carry out immunoprecipitations, 170 ^1 of 
reverse transcription reaction was added to 1 ml of Dilution Buffer (10 mM Tris-Cl, 

10 pH 8.2, 140 mM NaCl, 1% v/v Triton X-100) and 20 ^il of Protein G/A conjugate 
(Calbiochem, La Jolla, CA), and precleared by incubation at A'^C with rotation for 1 
hour. The eiuent was removed, and 20 \il G/A conjugate and 20 jil of monoclonal 
antibody (2 ^g, 12 picomoles) were added, and the sample incubated with rotation for 
two hours at 4°C. The conjugate was precipitated by microcentrifligation at 2500 rpm 

15 for 5 minutes, the eiuent removed, and the conjugate washed three times with 1 ml 
aliquots of ice cold Dilution Buffer. The sample was then washed with 1 ml ice cold 
10 mM Tris Cl, pH 8.2, 100 mM NaCl. The bound fragments were removed using 3 
volumes of frozen 4% HOAc, and the samples were lyophilized to dryness. 

PCR of Selected and Unselected Samples . PCR reactions were carried out 

20 by adding 20 \i\ of concentrated NH4OH to 10 ^il of the unselected material and the 
entirety of the selected material and incubating for 5 minutes each at 55 °C, 70** C, and 
90''C to destroy any RNA present in the sample. The samples were then evaporated 
to dryness using a speedvac. 200 jil of PCR mixture (1 \iM primers 21.103 and 
42.108, 200 ^iM dNTP in PCR buffer plus Mg^* (Boehringer Mannheim), and 2 ^1 of 

25 Taq polymerase (Boehringer Mannheim)) were added to each sample. 16 cycles of 
PCR were performed on unselected sample number 2, and 19 cycles were performed 
on all other samples. 

Samples were then amplified in the presence of 5' "P-Iabeled primer 
21.103 according to Table 3, and purified twice individually using Wizard direct PCR 

30 purification kits (Promega) to remove all primer and shorter fragments. 
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TABLE 3 

Amplification of Selected and Unselected PCR Samples 



10 



15 



20 



25 



30 



Sample 


Type 


Volume 


Cycles 


1 


unselected 


20 \i\ 


5 


2 


imselected 


Sill 


4 


3 


unselected 


20 ^l 


5 


4 


unselected 


20 ^L\ 


5 


5 


imselected 


20 ii\ 


5 


6 


unselected 


20 \i\ 


5 


1 


selected 


20 Hi 


5 


2 


selected 


5 111 


4 


3 


selected 


20 nl 


5 


4 


selected 


20 Hi 


7 


5 


selected 


20 \i\ 


7 


6 


selected 


20 \i\ 


7 



T?>.ctrir.tinn Digests. labeled DNA prepared fiom each of the above 
PCR reactions was added in equal amounts (by cpm of sample) to restriction digest 
reactions according to Table 4. The total volume of each reaction was 25 nl. 0.5^1 
of Alwnl (5 units. New England Biolabs) was added to each reaction. Samples were 
incubated at 37°C for 1 hour, and the enzyme was heat inactivated by a 20 minute 
incubation at 65°C. The samples were then mixed with 10 jil denaturing loading 
buffer (1 ml ultrapure formamide (USB), 20 m 0.5 M EDTA, and 20 ^ 1 M NaOH), 
heated to 90°C for 1 minute, cooled, and loaded onto a 12% denaturing 
polyacrylamide gel containing 8M urea. Following electrophoresis, the gel was fixed 

with 10% (v/v) HOAc, 10% (v/v) MeOH, HjO. 

TABLE 4 

Restriction Digest Conditions w/ Alwnl 

sample Type Volume DNA Total volume 

added to reaction 

unselected 20 nl 25 }i\ 

unselected 4 \i\ 25 nl 

unselected 20 jil 25 \il 

unselected 20 ^il 25 ]i\ 

unselected 4 \i\ 25 nl 



1 

2 
3 
4 

35 5 
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6 


unselected 


20 \t\ 


25 nl 


1 


selected 


20 ^1 


25 nl 


2 


selected 


8(il 


25 ^l 


3 


selected 


12^1 


25 Hi 


4 


selected 


12 ^l 


25 \i\ 


5 


selected 


20 ^1 


25 \i\ 


6 


selected 


20^1 


25 \il 



Quantitation of Digest . The amount of myc versus pool DNA present in a 
sample was quantitated using a phosphorimager (Molecular Dynamics). The amount 

10 of material present in each band was determined as the integrated volume of identical 
rectangles drawn around the gel bands. The total cpm present in each band was 
calculated as the volume minus the background. Three values of background were 
used: (1) an average of identical squares outside the area where counts occurred on 
the gel; (2) the cpm present in the unselected pool lane where the myc band should 

15 appear (no band appears at this position on the gel); and (3) a normahzed value that 
reproduced the closest value to the 10-fold template increments between unselected 
lanes. Lanes 2, 3, and 4 of Figures 16B and 16C demonstrate enrichment of the target 
versus the pool sequence. The demonstrable enrichment in lane 3 
(unselected/selected) yielded the largest values (17, 43, and 27 fold using methods 

20 1-3, respectively) due to the optimization of the signal to noise ratio for this sample. 
These results are sxmmiarized in Table 5. 

TABLE 3 

Enrichment of Myc Template vs. Pool 



Method Lane 2 (20) Lane 3 (200) Lane 4 (2000) 

25 1 7.0 16.6 5.7 

2 10.4 43 39 

3 8.7 27 10.2 



In a second set of experiments, these same PCR products were purified 
once using Wizard direct PCR purification kits, and digests were quantitated by 
30 method (2) above. In these experiments, similar results were obtained; enrichments of 
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10.7, 38, and 12 fold, respectively, were measured for samples equivalent to those in 
lanes 2, 3, and 4 above. 

M ^V OF PROTE T^T ^^T fryn^ SYSTEMS 
The selection systems of the present invention have commercial 
5 applications in any area where protein technology is used to solve therapeutic, 

diagnostic, or industrial problems. This selection technology is useful for improving 
or altering existing proteins as well as for isolating new proteins with desired 
functions. These proteins may be naturally-occurring sequences, may be altered 
forms of naturally-occurring sequences, or may be parUy or fully synthetic sequences. 
10 Te^i.tinn nf Nove l pinHma Reagents. In one particular application, the 

RNA-protcin fusion technology described herein is useful for the isolation of proteins 
with specific binding (for example, ligand binding) properties. Proteins exhibiting 
highly specific binding interactions may be used as non-antibody recognition 
reagents, allowing RNA-protein fusion technology to circumvent traditional 
15 monoclonal antibody technology. Antibody-type reagents isolated by this method 

may be used in any area where traditional antibodies are utilized, including diagnostic 

and therapeutic applications. 

T,^r— H"tn.n Antibodies. The present invention may also be 
used to improve human or humanized antibodies for the treatment of any of a number 
20 of diseases. In this application, antibody libraries are developed and are screened in 
vitro, eliminating the need for techniques such as cell-fusion or phage display. In one 
important application, the invention is useful for improving single chain antibody 
libraries (Ward et al.,Nature 341:544 (1989); and Goulot et al., J. Mol. Biol. 213:617 
(1 990)). For this application, the variable region may be constructed either from a 
25 human source (to minimize possible adverse immune reactions of the recipient) or 

may contain a totally randomized cassette (to maximize the complexity of the libi«y). 
To screen for improved antibody molecules, a pool of candidate molecules are tested 
for binding to a target molecule (for example, an antigen immobiUzed as shown in 
Figure 2). Higher levels of stringency are then applied to the binding step as the 
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selection progresses from one round to the next. To increase stringency, conditions 
such as number of wash steps, concentration of excess competitor, buffer conditions, 
length of binding reaction time, and choice of immobilization matrix are altered. 
Single chain antibodies may be used either directly for therapy or 
5 indirectly for the design of standard antibodies. Such antibodies have a nimiber of 
potential applications, including the isolation of anti-autoimmune antibodies, immune 
suppression, and in the development of vaccines for viral diseases such as AIDS. 

Tsolation of New Catalysts . The present invention may also be used to 
select new catalytic proteins. In vitro selection and evolution has been used 
10 previously for the isolation of novel catalytic RNAs and DNAs, and, in the present 
invention, is used for the isolation of novel protein enzymes. In one particular 
example of this approach, a catalyst may be isolated indirectly by selecting for 
binding to a chemical analog of the catalyst's transition state. In another particular 
example, direct isolation may be carried out by selecting for covalent bond formation 
1 5 with a substrate (for example, using a substrate linked to an affmity tag) or by 

cleavage (for example, by selecting for the ability to break a specific bond and thereby 
liberate catalytic members of a library from a solid support). 

This approach to the isolation of new catalysts has at least two important 
advantages over catalytic antibody technology (reviewed in Schultz et aL, J. Chem. 
20 Engng. News 68:26 (1990)). First, in catalytic antibody technology, the initial pool is 
generally limited to the immunoglobulin fold; in contrast, the starting library of 
RNA-protein fusions may be either completely random or may consist, without 
limitation, of variants of known enzymatic structures or protein scaffolds. In addition, 
the isolation of catalytic antibodies generally relies on an initial selection for binding 
25 to transition state reaction analogs followed by laborious screening for active 
antibodies; again, in contrast, direct selection for catalysis is possible using an 
RNA-protein fusion library approach, as previously demonstrated using RNA 
libraries. In an alternative approach to isolating protein enzymes, the 
transition-state-analog and direct selection approaches may be combined. 
30 Enzymes obtained by this method are highly valuable. For example, there 
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10 



15 



cogently exists a pvesstag n«d for nove, and effecUv. Wustti.1 c-alysts tt»l allow 
™p„,ved chem-ca. p^oesses .o be developed. A major «lvanag. of .be tavenbon .s 
U«, selecbons may be earned out in arbib^ conditions »>d are no. lim,.ed, for 
example, .o io ^ eonditions. The invention U^efore f«>iUb^ d,e isolaton of 
™vel enzymes or improved variant of existing enzymes ean c»,y ou, tagUy 
specific ttansformarions (a«l hereby minimize fl.e fom««ion of .ndesned 
byp,oduc«) while fumtioning in predetem-inedem-ironmaits. for example. 
e„vi«,mne„uofeleva.edtempera.»re.pressure.orsolven.concenttanon. 

j„., ,f;.,„,„,.„...i„nTraD. The RNA-prottin fusion .eebnology .s also 
useful tor screening cDNA libraries and cloning new genes on die basis of 
p,„,ein.pro.ein in«racUo„s. By U.is med»d. a cDNA libra,, is genera«d a 
desired source (for example, by .he m«hod of Ausubel « al.. supra. chap«r 5). To 
.a„h of d.e candidate cDNAs..pep«deaccep»r(fbr example. asapmomycmU-Ou 

„ga.ed (for example, using d,e techniques descrthed above for «,e genenfon of IP77 
1:^,54 andLP160,. RNA-pn>tein fbsions are then gen«ated as described berc.n. a.,J 
,he ability ofU,esefi.si«.s(or improve! v^o,»otthe«ons)tointerac.wi,h 
particular moleculesistbent^edas described above. If desired. «opcoaonsand3 
im. regions may be avoided in this process by either (i)addmg suppressor tBKAto 

allow readthrough of the stop regions, (ii) removing tt» release fecor torn «,e 
, ™s,a.ionreac.ionbyimmunoprecipi.a.ion.(iiOac„mbinaao„of(i)«..(n>.or(.v) 

removal ord.es.opcodo„s»«. 3- UTRfiom the DNA sequences. 

The fact that the interaction step takes place injte allows careful control 

of the .^.ction stringency, using «,nspecif.c comp«itor. temperature, and .onto 
„„di.i„ns.Al.emtio„ofno,malsma.lmol««.eswithnon-h,drolyzablea^logs 
,5 ,eg ATPvs.ATTgS)provides6.rselectio»>.hatdiscriminatebe»veend,ffcrent 

■ eonf^tmersotthesamemoiccule. This approach is useM for both d,e cU,nmg»^ 
(.uctionalidentificMi^tofmanyproteim since theBNAseqnenceofthe selected 

binding partner is eovalently attached and may the«fore be readily isolated. In 
addition, the technique is usefitl for idcnUlying functions and interactions of tiie 
30 -50-100.000 human genes, whose sequences are currentiy being dctem,med by the 
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Human Genome project. 

USE OF RNA.P ROTETN FUSIONS TN A MICROCHIP FORMAT 
"DMA chips" consist of spatially defined arrays of immobilized 
oligonucleotides or cloned fiagments of cDNA or genomic DNA, and have 

5 applications such as rapid sequencing and transcript profiling. By annealing a 

mixture of RNA-protein fusions (for example, generated from a cellular DNA or RNA 
pool), to such a DNA chip, it is possible to generate a "protein display chip." in which 
each spot corresponding to one immobilized sequence is capable of annealing to its 
corresponding RNA sequence in the pool of RNA-protein fusions. By this approach, 

1 0 the corresponding protein is immobilized in a spatially defined manner because of its 
linkage to its own mRNA, and chips containing sets of DNA sequences display the 
corresponding set of proteins. Altemativcly, peptide fi-agments of these proteins may 
be displayed if the fusion library is generated from smaller fragments of cDNAs or 
genomic DNAs. 

1 5 Such ordered displays of proteins and peptides have many uses. For 

example, they represent powerful tools for the identification of previously unknown 
protein-protein interactions. In one specific format, a probe protein is detectably 
labeled (for example, with a fiuorescent dye), and the labeled protein is incubated 
with a protein display chip. By this approach, the identity of proteins that are able to 

20 bind the probe protein are determined irom the location of the spots on the chip that 
become labeled due to binding of the probe. Another application is the rapid 
determination of proteins that are chemically modified through the action of 
modifying enzymes (for example, protein kinases, acyl transferases, and methyl 
transferases). By incubating the protein display chip with the enzyme of interest and a 

25 radioactively labeled substrate, followed by washing and autoradiography, the 

location and hence the identity of those proteins that are substrates for the modifying 
enzyme may be readily determined. In addition, the use of this approach with ordered 
displays of small peptides allows the further localization of such modification sites. 

Protein display technology may be carried out using arrays of nucleic acids 
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(including RNA, but preferably DNA) unmobilized on any appropriate solid support. 
Exemplary solid supports may be made of materials such as glass (e.g., glass plates), 
silicon or silicon-glass (e.g.. microchips), or gold (e.g., gold plates). Methods for 
attaching nucleic acids to precise regions on such solid surfaces, e.g., 
5 photolithographic methods, are ^vell known in the art, and may be used to generate 
solid supports (such as DNA chips) for use in the invention. Exemplary methods for 
this purpose include, without limitation, Schena et al.. Science 270:467-470 (1995); 
Kozal et al.. Nature Medicine 2:753-759 (1996); Cheng et al.. Nucleic Acids Research 
24:380-385 (1996); Lipshutz et al., BioTechniques 19:442-447 (1995); Pease et al., 
10 Proc Natl. Acad. Sci. USA 91:5022-5026 (1994); Fodor et al.. Nature 364:555-556 
(1993); Pirrung et al., U.S. Patent No. 5,143,854; and Fodor et al.. WO 92/10092. 
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1 . A method for the selection of a desired protein, comprising the steps of: 

a) providing a population of candidate RNA molecules, each of which 
comprises a translation initiation sequence and a start codon operably linked to a 

5 candidate protein coding sequence and each of which is operably linked to a peptide 
acceptor at the 3' end of said candidate protein coding sequence; 

b) in vitro or in situ translating said candidate protein coding sequences to 
produce a population of candidate RNA-protein fusions; and 

c) selecting a desired RNA-protein fusion, thereby selecting said desired 

1 0 protein. 

2. A method for the selection of a DNA molecule which encodes a 
desired protein, comprising the steps of: 

a) providing a population of candidate RNA molecules, each of which 
comprises a translation initiation sequence and a start codon operably linked to a 

1 5 candidate protein coding sequence and each of which is operably linked to a peptide 
acceptor at the 3' end of said candidate protein coding sequence; 

b) in vitro or in siJu translating said candidate protein coding sequences to 
produce a population of candidate RNA-protein fusions; 

c) selecting a desired RNA-protein fusion; and 

20 d) generating jfrom said RNA portion-of said fusion a DNA molecule 

which encodes said desired protein. 

3. A method for the selection of a protein having an altered function 
relative to a reference protein, comprising the steps of: 

a) producing a population of candidate RNA molecules from a population 
25 of DNA templates, said candidate DNA templates each having a candidate protein 

coding sequence which differs from said reference protein coding sequence, said RNA 
molecules each comprising a translation initiation sequence and a start codon operably 
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UnKed .o .aid Candida.. p«.»in coding sequence a™. ««h being operably liniced u. . 

peptide acceptor at the 3' end; 

b) in Jdlm or in situ translating said candidate protein coding sequences to 

produce a population of candidate RNA-protein fosions; and 
5 c) selecting an RKA-protein fusion having an altered function, thereby 

selecting said protein having said altered function. 

4 A method for the selection of a DNA molecule which encodes a 
protein having an altered functionr^lativetoareferenceprotein. comprising the st^ 

10 a)producingapopulationofcandidateRNAmoleculesfromapopulation 
ofcandidateD^^A templates, said candidateDNA templates each havingacandxdate 

protein coding sequence which differs firom said reference protein coding sequence, 
said RNA molecules each comprising a translation initiation sequence and a start 
codon operably linked to said candidate protein coding sequence and each bemg 
15 operably linked to a peptide acceptor at the 3- end; 

b) in vitm or in Situ translating said candidate protein coding sequences to 
produce a population of RNA-protein fusions; 

c) selecting an RNA-protein fusion having an altered function; and 

d) generating from said RNA portion of said fusion a DNA molecule 
20 which encodes said protein having said altered function. 

5 A method for the selection of a desired RNA, comprising the steps of: 

a) providing a population of candidate RNA molecules, each of whxch 

comprises a translation initiation sequence and a start codon operably linked to a 
candidateproteincodingsequenceandeachofwhichisoperablylinkedtoapept.de 

25 acceptor at the 3' end of said candidate protein coding sequence; 

b) in Yitm or insite translating said candidate protein coding sequences to 

produce a population of candidate RNA-protein fusions; and 

c) selecting a desired RNA-protein fusion, thereby selecting said desired 
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RNA. 

6. The method of any of claims 1-5, wherein said peptide acceptor is 
puromycin. - 

7. The method of any of claims 1-5, wherein each of said candidate RNA 
5 molecules further comprises a pause sequence or further comprises a DNA or DNA 

analog sequence covalently bonded to the 3' end of said RNA molecule. 

8. The method of any of claims 1-5, wherein said population of candidate 
RNA molecules comprises at least 10" different RNA molecules. 

9. The method of any of claims 1-5, wherein said ia vitro translation 

1 0 reaction is carried out in a lysate prepared from a eukaryotic cell or portion thereof. 

10. The method of claim 9, wherein said in vitro translation reaction is 
carried out in a reticulocyte lysate. 

1 1 . The method of claim 9, wherein said in vitro translation reaction is 
carried out in a wheat germ lysate. 

15 12. The method of any of claims 1 -S, wherein said in vitro translation 

reaction is carried out in a lysate prepared from a bacterial cell or portion thereof. 

13. The method of any of claims 1-5, wherein said selection step 
comprises binding of said desired protein to an inmiobilized binding partner. 



20 



14. The method of any of claims 1-5, wherein said selection step 
comprises assaying for a functional activity of said desired protein. 
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1 5. The method of claims 2 or 4, wherein said DNA molecule is 

amplified. 

16. The method of claims 1, 3, or 5, wherein said method further 
comprises repeating step (a) through (c). 

5 17. The method of claims 2 or 4, wherein said method further comprises 

transcribing an RNA molecule from said DNA molecule and repeating steps (a) 
through (d). 

18. The method of any of claims 1-5, wherein said RNA is covalently 
bonded though an amide bond to said protein in said RNA-protein fusion. 

10 1 9. The method of any of claims 1-5, wherein said RNA is covalently 

bonded to said protein in said RNA-protein fusion, said covalent bond being resistant 
to cleavage by a ribosome. 

20. The method of any of claims 1-5, wherein, following the in vifia 
translating step, an incubation is carried out in the presence of 50-100 mM 

15 21 . The method of any of claims 1-5, wherein said RNA-protein fusion 

further comprises a nucleic acid or nucleic acid analog sequence positioned proximal 
to said peptide acceptor which increases flexibility. 

22. An RNA-protein fusion selected by any of the methods of claims 1-5. 

23. A molecule comprising a ribonucleic acid covalently bonded though 
20 an amide bond to a protein. 



24. The molecule of claim 23, wherein said protein is encoded by said 
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ribonucleic acid. 

25. The molecule of claim 23, wherein said ribonucleic acid is a 
messenger RNA. 

26. A molecule comprising a ribonucleic acid covalently bonded to a 
5 protein, said protein being entirely encoded by said ribonucleic acid. 

27. The molecule of claim 26, wherein said ribonucleic acid is messenger 

RNA. 

28. A molecule comprising a ribonucleic acid covalently bonded to a 
protein, said covalent bond being resistant to cleavage by a ribosome. 

1 0 29. The molecule of claim 28, wherein said ribonucleic acid is messenger 

RNA. 

30. A ribonucleic acid comprising a translation initiation sequence and a 
start codon operably linked to a candidate protein coding sequence, said ribonucleic 
acid being covalently bonded to a peptide acceptor at the 3* end of said candidate 

1 5 protein coding sequence. 

3 1 . A method for the selection of a desired protein or desired RNA, 
comprising: 

(a) providing a population of candidate RNA molecules, each of which 
comprises a translation initiation sequence and a start codon operably linked to a 

20 candidate protein coding sequence and each of which is operably linked to a peptide 
acceptor at the 3' end of the candidate protein coding sequence; 

(b) in vitro or in situ translating the candidate protein coding sequences to 
produce a population of candidate RNA-protein fusions; 
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(c) contacting said population of RNA-protein fusions with a binding 
partner specific for either the RNA portion or the protein portion of said RNA-protein 
fusion under conditions which substantially separate said binding partner-RNA- 
protein fusion complex from unbound members of said population; 

5 (d) releasing said bound RNA-protein fusions from said complex; and 

(e) contacting said population of RNA-protein fusions from step (d) with a 
binding partner specific for the protein portion of said desired RNA-protein fusion 
under conditions which substantially separate said binding partner-RNA-piotein 
fusion complex from unbound members of said population, thereby selecting the 

1 0 desired protein and the desired RNA. 

32. The method of claim 31, wherem said method fiirther comprises 
repeating steps (a) through (e). 

33. The method of claim 31, wherein said peptide acceptor is puromycin. 

34. The method of claim 3 1 , wherein each of said candidate RNA 

15 molecules further includes a pause sequence or fiirther comprises a DNA or DNA 
analog sequence covalently bonded to the 3' end of said RNA molecule. 

35. The method of claim 3 1 . wherein said population of candidate RNA 
molecules includes at least 10'^ different RNA mokcules. 

36. The method of claim 31, wherein said in JOttQ translation reaction is 
20 carried out in a lysate prepared from a eukaryotic cell or portion thereof. 

37. The method of claim 36, wherein said in yilm translation reaction is 
carried out in a reticulocyte lysate or wheat germ lysate. 

38. The method of claim 31. wherein said in vitts translation reaction is 
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carried out in an extract prepared from a prokaryotic cell or portion thereof. 

39. The method of claim 31, wherein at least one of said binding partners 
is immobilized on a solid support. 

40. The method of claim 3 1 , wherem, following the in vifro translating 
step, an incubation is carried out in the presence of 50-100 mM Mg^*. 

41 . The method of claim 31, wherein said RNA-protein ftision further 
comprises a nucleic acid or nucleic acid analog sequence positioned proximal to said 
peptide acceptor which increases flexibility. 

42. A microchip comprising an array of immobilized single-stranded 
nucleic acids, said nucleic acids being hybridized to RNA-protein fusions. 

43. The microchip of claim 42, wherem said protein is encoded by said 

RNA. 



wo 98/31700 



PCTAJS98/00807 



1/16 



-ORF 



Fig. 1 A T7 IS 



3' CONSTANT 



+ DNASPUNT 

PUROMYCIN-TETHERED OUGO IS LIGATED I 
I— 40J TO mRNA (GENERATED FROM ABOVE T DNAUGASE 
Fig. 1D< CONSTRUCT) IN THE PRESENCE of 
' A SPUNT AND DNAUGASE 



NOTE: FOR SHORT ORFS. THIS 
WHOLE TEMPLATE CAN 
L BEMADESYNTHETICAUy 



Fig. ^C< 




IN VITRO TRANSUTION PROCEEDS NORMALLY 
FRoKiE?TOTHE3"ENDOFTHErnRNA 



5" 



COVALENTLY UNKED PUROMYON ENTERS THE A 
STTE AND AnACKS PEPTIOYL WNA IN P SfTE 



RELEASE OF RNA-PROTEIN FUSION WITH 
HIGH SALT WASH OF RIBOSOME 



soBsrrruTE sheet (rule 26) 




wo 9801700 



PCT/US98/00807 



2/16 



GENERALIZED PROTOCAL FOR SELECTION EXPERIMENTS 



PUROMYCIN 
TETHERED 
OLIGO 



DNA 
TEMPLATE 




mRNA 



AMPUFICATION 
VIA PCR 
(ADD T7 PROMOTER) 

PARTIAL 
DNA 
TEMPLATE 



RNA/PURON4YCIN 
CONJUGATE 



REVERSE 
I TRANSCRIPTION 



IN VITRO 
TRANSLATION 




PROTEIN 



RNA 
o — — 

RNA PROTEIN-FUSION 




ISOLATION VIA IMMOBIUZED SELECTION MOTIF 

-PROTEIN -CARBOHYDRATE 
-RNA -SMALL MOLECULE 
- -DNA 

-TRANSITION STATE ANALOGS ETC.. 



Fig. 2 



SUBSTITUTE SHEET (RULE 26) 



PCTAJS98AMI807 



3/16 




SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



PCT/US98/00807 



4/16 



HO-H2C 

.0, 




PROTECT AMINO GROUP 
WITH (CF3CO)2 




HO-H2C 



H] [H 

H— N OH 



=0 



H3N+ 



OCHa 



PHN 

P^PROTECTING GROUP 



PUROMYCIN 



.CHa 



PROTECT 5 0H 
WITH TRITYL 
CHLORIDE 



CHs^ ^CHs 
N 



OCH3 



DMTO 



HoC N^^N-^ 




DMTO-H2C N 

^O. 



PHN 



-|H n n CPG m — [h 

Qx^\^\0'^'^ LINK 2'OH TO SOLID h— fvl OH 




SUPPORT ALKYL AMINE 
CPG WITH 
1) DCC/SUCCINIC 
OCH3 ANHYDRIDE PHN 



2) p-NITROPHENOL 




OCHa 



PROTECTED, CPG PUROMYCIN 



USE AS SOLID SUPPORT IN 
AUTOMATED DNA SYNTHESIZER 
-CLEAVAGE YIELDS 3' PUROMYCIN 
TETHERED OUGONUCLEOTIDE 



Fig^ 



SUBSTITUTE SHEET (RULE 26) 



FCT/US98/00807 

WO 98/31700 



5/16 




Fig. 5 



SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



6/16 



PCTA;S98/D0807 



Lane 1.2 3 4 5 6 

Mg+2[mMJ 32P 6 9 12 15 18 



43-P 



t 




i-*tRNA 



35S met 
labeled 
product 



35S met 
labeled 
product 



mm 



H 



NH.OH 



ig 



Fig. 6B 



"CUG" 



V'^'Sl. Goug Ohlor. i:5U0uM.| 




143-P 



43-P 
[CUG] 




tRNA 



Fig. 6C 



F-ig. 6D 



SUBSTITUTE SHEET (RULE 26) 



^ PCT/US98rtJ0807 
WO 98/31700 

7/16 



Linker (dA)n n=27 ^=24 r£21 [1=18 
Mg+2[mMl 9 18 9 18 9 18 9 18 

tRNA*- iP^ 



met 
labeled 
Droduet 



Fig. 6E: 



43- P * 
■25-P 




ig. 6F 



43-P . . -^.i 

■*. r<i.p.v..' 



43-P 




•13-P 



43-P 



-*30-P 



Fig 6H 



SUBSTITUTE SHEET (RULE 26) 



PCTAJS98/00807 

8/16 



AUG 



MYC 



LINKER 



M EQKLISEEDLN 



LP77 



Fig. 7A 



dAzydCdC 
"30-P' 



AUG 



MYC 



M AEEQKLISEEDLLRKRR 
EQLKHKLEQLRNSCA 



LP154I 

Fig. 7B 



AUG 



M 



(NNe/c)27QLRNSCA 



pooim\ 
Fig. 7C 



LINKER. 



dA27dCdC 
"30-P" 



LINKER 




dA27dCdC 
"30-P" 



SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



o 

8 

Q. 

CM 
CO 



9/16 
no RNase 



Retic 



Wheat 



CL 

CO CL 
-J 



to 



^ ^ I 

5 i 2: 



PCTAJS98/00807 



+ RNase A 



Retic 



Wheat 



9r f: 2 
CO, a. CL 
-J ^ 



DL ^ 

CO Q- a- 
^ -I 



LP154I 
LP771 



4 



m 



43-P? 
30-.P-i 



Fig. 8 



Effects of Rfsl#e A ancJ Proteinase K on ^usion 

Lane# 1 



pmoles LP1!54 



LP 154 Fusion 



30-P Fusion i 



2 3 4 5 6 7 8 9 .1,0 

.CIT25 dt25 tRNaseA 
.isolated .4-RtMaseA :*;Protj<_ 



5 10-10 5 10-10 v!5 lO-r^lO 



30-p 

Fig. 9 

SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



10/16 



PCTAJS98/00807 



Lane# 1 2 3 4 5 6 7 8 

IP with c-myc mAb 

. r : . ." -PSS; DB PBS P8S; ?be PBS 
controls IDS TDS 




Fif . 10 



SUBSTITUTE SHEET (RULE 26) 



\VO9S/31700 



PCT/US98y00807 



Lane* 



11/16 

1 2 3 4 5 6 7 
RNA124I r LP154 I 



5 10 20 5 10 20 pmoles template 





30-P Fusion 



0.43% 



4 llf - 



0.70%.. 



0.2 V/o 

— . ) 

A 



CPM 
UNMODIFIED 
30-P 



3 10'' 



2 lO'^ 



1 10*' 



0 Kr 



30-P 
FUSION 



- no' 

I -soot) 
I 

■-6000 
•4000 
2000 



CPM 30 P 
FUSION 



0 



li 10 15 20 2b 



PICOMOLirS LP 154 _ 

I^OX to'-'MOl.ECUl,F.S FUSION/ml rRANSLAIlONj 



Fig. 12 



SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 PCT/US98/0OSO7 

12/16 




SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



13/16 



PCT/US98/D0807 



CO 
lO 

CO 
CM 



DO 
IT) 

d) 



<D 
ID 

CO 
CM 



J 



< 

LO 
1 — 

Ll. 



i i k kkk k 

CO O LT) CD -r-- a> 

a> CQCsJCM T- 



SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 



14/16 



PCT/US98/00807 



IN VITRO 5 
TRANSUATIQN^ 



5* 



PGR 




UNSEllECTED: 
MIXTURE 



SELECTED 
■ MIXTURE 



r~ 



myc TEMPLATE 



* ■ ! 



POOL 



Fig. 16A 



Pool*- 



12 3 4 

mm 



12 3.4 



myc-||i» 

Fig. 16B 



SUBSTITUTE SHEET (RULE 26) 



WO 98/31700 



PCTAJS98/00807 



15/16 



1 2 3 4 5 6 7 8 9 10 11 12 



m — A: 



SUBSTITUTE SHEET (RULE 26) 



wo 98/31700 PCT/US98/00807 

16/16 





Fig. 19 

SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/00807 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C07H 21/02; C07K 4/00, 14/00; C12Q 1/68 

US CL :435/6; 530/300, 350; 536/23.1 
According to International Patent Classification (IPC) or to both national classification and IPC 

R FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 435/6; 530/300, 350; 536/23.1 

Documentation searched other than minimum documenUtion to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of dau base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



x,p 

Y,P 



ROBERTS et al. RNA-peptide fusions for the in vitro selection of 
peptides and proteins. Proc. Natl. Acad. Sci. USA November 1997, 
Vol. 94, No. 23, pages 12297-12302, see entire document. 



1,2,3-7,9- 
12,15,18- 
24,26,28,30-39,41 

3,4,8,13,14,16,17 
,25,27,29,40,42,4 
3 



[ [ Further documents are listed in the continuation of Box C. | | See patent family annex. 



* special caCegoriet of cited doctimenu: 

* A* document defining the general state of the ait whidi is not considered 

to be of particular relevance 

"E* earlier document published on or mfitt the international filing date 

*L* document which may throw doubu on priority clsini(s) or which is 

cited to establish the publication date of another ciution or other 
special reason (as specified) 

'O' document referring to an oral disclosure, use. exhibition or other 

means 

"P* document published prior to the international filing date but later than 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

doctmtent of particular relevance; tlie claimed invention cannot bo 
considered funrel or cannai be considered to involve an inventive step 
when the document b taken alone 

document of particular relevance; the claimed invention cannot be 
considered to inv^e an inventive step when the doctnnent is 
combined with one or more other such documents, such combination 
being obvious to s person skilled in the ait 

document member of the same patent family 



Date of the actual completion of the international search 
06 MAY 1998 


Date of mailing of the international search report 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C 20231 
FacsimUc No. (703) 305-3230 


Telephone No. (703) 308-0196 / 



Form PCT/IS/W210 (second .shcct)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/00807 



B. HELDS SEARCHED 

Electronic data bases consulted (Name of dato base and where practicable terms used): 

STN: MEDUNB. BIOSIS. EMBASE, CAPLUS. WPIDS. JAPIO. PATOSEP, PATOSWO 
APS 

Search Terms: RNA, protein, peptide, polypeptide, fusion, hybrid, chimera, linked, invitro selection, in vitro 
evolution, directed evolution, library, population 



Foitn PCT/lSA/210 (extra sheet)(July 1992)* 



THIS PAGE BLANK (ysm^ 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 
QIbLURRED or ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS • - v,'. . 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



THIS PAGE BLANK (Msm> 



